Gene Nmul_A2384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2384 
Symbol 
ID3784975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2715871 
End bp2717439 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content60% 
IMG OID637812473 
Producthypothetical protein 
Protein accessionYP_413065 
Protein GI82703499 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCGC TCAAGAACAA TCTATCCACT GCCATTTACT CCACGGCTGA GATCAGAGAA 
ATCGAACACC TGGCGGCCGG CCTGCCGGGC CGGCCTCAAT TGATGGAAGC AGCCGGGCTC
GCCGCCGCTG AAATTGCCCG TGACCGGCTG CTCTCCCCCC ATAAACCCCG CTTGCTGGTA
CTGGCGGGGC CCGGGAACAA CGGGGGCGAT GCCTTTGTCG CGGCGCGCCA CCTGCGGGAG
TGGGGGTTCA AGGTAACGCT GGTGTTTACC GGCGAGCAGG CAAAGTTATC GGCGGATGCG
CTGCGGGCGC TGAATGCCTG GGTTGCCACC GGCGCTGGGA TGGTCTCCGA AATTCCGGAA
AATGAAACAT GGGATGCCGT GATCGACGGC TTGTTCGGTA TTGGTCTGGA CCAGCAAGGC
GGACGAGAAC TGGGCGGCAA ATATCTGGCT ATGGTGAATA CCGTTAACGC CATGATGCTG
CCGGTGCTGT CCATCGATAT TCCCAGCGGT CTGGGCAGCG ATACCGGCGC TGTGTGCGGC
GCGGCTATCA TCGCGACCAT GACGGCGACA TTCATTGGCC TGAAACCCGG CCTCTTCACG
AATGAGGGCC CCGATTATTG CGGGAAAGTT TTTTTGCACG ACCTTGATCT CGATGTCTCA
TCCCTGAAGA AACCCGATGC GTGGCTGATG GACCAGATGC ACATCCGGCG GCTCCTGCCC
CCTCCCCGGC GGGCTAACAG TCACAAGGGC ATGTTCGGCA GTATCGGGGT AATCGGAGGA
ACAGCCGGCA TGGTCGGGGC TGCGCTTCTG GCGGGTACAG CGGCGCTGAA ACTGGGTGCC
GGACGCGTTT ACCTGGGATT GATGGCGCCG GACGCGCCAG CGGTCGACAC CTTCCAGCCG
GAACTGATGC TGCGCCCTAT CCAGGATCTG TTCAAGCTGG AGCAGCTGAA CTGTCTGGTG
GTTGGTCCGG GTTTGGGAAC GGAAACCGCC GCATACTTCT GGCTCAAGTG CGCGCTGCAA
ACCACTCTAC CACTGGTTCT CGATGCAGAT GGCTTGAATC TCGTTGCCTC GCATTCCGAA
ATAGCGGGAT TGCTGCGGGA ACGGTTGCGC GAACGCCATG CGCCTTCCAT TCTCACTCCT
CATCCGGCCG AAGCCGCCCG CCTCCTGAAA AGCACCACGA CTTCCGTACA ACAGGACCGG
ATGGCAGCCG CTGCGGAGCT GGCCCAGCGT TTTAACTGCT GGATTGTGTT GAAAGGCGCA
GGCAGCGTGT GCGCGATGCC GGAGGGCAGA CGCTTCATCA ACACAAGTGG AAATCCCGGC
TTAAGCAGCG CAGGAACGGG CGATATCCTC TCCGGGATGA TTGGCGCCTT TCTGGCACAG
CGATCGAGCC CGGAAAACGC GCTGCTCGCT GCTGTATACC TGCACGGAGC GGCTGCCGAC
GTATTGCAGA AGCAGTATGG CGGCGGCATT GGAATGACCG CCTCCGAAAT TCCCAACGTC
GCCCGTAACC TGTTGAACCA ATGGATTGCC GTTAATTCTG CCCCGGCCCC TCACCAGCAA
GAAGGCTGA
 
Protein sequence
MNALKNNLST AIYSTAEIRE IEHLAAGLPG RPQLMEAAGL AAAEIARDRL LSPHKPRLLV 
LAGPGNNGGD AFVAARHLRE WGFKVTLVFT GEQAKLSADA LRALNAWVAT GAGMVSEIPE
NETWDAVIDG LFGIGLDQQG GRELGGKYLA MVNTVNAMML PVLSIDIPSG LGSDTGAVCG
AAIIATMTAT FIGLKPGLFT NEGPDYCGKV FLHDLDLDVS SLKKPDAWLM DQMHIRRLLP
PPRRANSHKG MFGSIGVIGG TAGMVGAALL AGTAALKLGA GRVYLGLMAP DAPAVDTFQP
ELMLRPIQDL FKLEQLNCLV VGPGLGTETA AYFWLKCALQ TTLPLVLDAD GLNLVASHSE
IAGLLRERLR ERHAPSILTP HPAEAARLLK STTTSVQQDR MAAAAELAQR FNCWIVLKGA
GSVCAMPEGR RFINTSGNPG LSSAGTGDIL SGMIGAFLAQ RSSPENALLA AVYLHGAAAD
VLQKQYGGGI GMTASEIPNV ARNLLNQWIA VNSAPAPHQQ EG