Gene Nham_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_2000 
Symbol 
ID4031553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp2222476 
End bp2224869 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content62% 
IMG OID637970457 
Producthypothetical protein 
Protein accessionYP_577259 
Protein GI92117530 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGCGGC GCAAAGGCGG TTCTGCAGCA GGGATGGGTC TGCTCATCGT TCTGGGTCTG 
ATCTTTGCGG CCTTGAGCGC GGTCTATCGA TTCATTGTCG AAAATCTGGC GGCGATCACA
GGGTTCGGCA CCGTCGTGGG ATTGCTGCTG ATTCTGTGGT ATCTATTCTC GAAGCTGGAA
GGCCAAAAGT CGTCCGTCGC AGAGCGGCCA TTAGCTACTC AACAACCGCA AATCCAGCTT
CTCAGCCAGC GCGGATACGT CGAGCCTCTA GTGCAGTTCG GGTCCCGGGC CAGGTCCGCA
GGGACGTCCG GCCAGACCAG GTGGGTAGCC CCGACCGAAA CGATCATCGT CCAAGGCGTA
TCCGTTGCCG GAGGGATGTT CTATCTCGGC ACCGCCCTTT CTTTCGAGGG ACGGGAAATC
GACGAATACG TCGTCAATCC CGAGCTTTCG GCCAAGTCAT CCAGACCGGA CGTCGAGGGC
ACTTCAATGC CCTACTGGCC GTCCTATGCG GAGGTGGCAC CGGCGGCCAG GCGGGCGTTC
CTTGAGTGGA TGTCGACGGG CAGGCGGGCC AAGCCATATG GAATCGGGCA CGTCTTTCTG
TATTTCTACG GTCTGGAGCA CCGAGTATTT GCGGACCGGG ACGTAGGAAA CGTGCCCGCT
TTGATTGCGG AGGTGGACCA GCTCCTTAGG GTCTACGGCG ATAACGGCTC TTTTCATGGT
TACGCAACGC ACTTCCTCGA TCTCGCGCGC TGCGCGGCCG GCTTGCCGCT TCCCGTTCCC
GAGCCATCTG CCGACACAGC TTATGCCGCC GAAATGGGTA CGGCGGTCCG CCTCCACCTT
GGGCGGCGCC TGGCACAATC GGCCGCGATT CTGTCCGAAG ACGCGCTGAT CTGGGTTCTA
GCCTTGCCCG ACGTCTATCT CCGGACCGCG GCGGTCCGGT GCTTCGATAA ATTCGTGCCC
CTGTGGCATC TCCGTTTCCG GAGCAGGTTT CCGGAAGGAC TGAGGGTCGC GACGTCAGGC
AACATCGACT TGAGCTACCG GGCTGCAAGC GGTGCATTCG AAGTGCCGGT TGATGGGCCC
CATCGCGATT ATCCGGACGT GACGAAAGTG AAGACTTCTC TCGAACCGTT GAGACAGCTT
GTGCAGGAAT GTACGGACGA ACTCGATGGA TTCAGCCGAT TGCTAGGCCG ACGTCCCGAA
GCTCGAAACT CCGTCCAAGC CGCGCTATTG CTTCCGGAGG ATCTGTTGGC CGAAACCGTT
TTCGAGGCTG TGCGCGAGTT CGGACAGCGG CTCTCGGAGA TCATGGGCGG CAAGCAACTC
GCCAGCACGA AGATGGATAC GGTGCTTCGG TTGGCCAATT TCGAGTTGCC AGACAGCGGC
AAGTTGTCGC CCGCGGTCGC CGACCAACTG GGCCAAGTGC TCGATCGCCT CGACATCGCA
ATCGAACCGG ACCGGCGTTA CGGGGGAGGG GTCCCGCAGC CGCAAGACCA GGTGTTTCTG
TTCAACGCCC CCGGGGGAGG CCCCGTGGAT TCGGAGCGGC CCGCTTACCG ATCGATGAAG
GCGCAGGTCG AAGTTGCAGT GCTGGCGGCT GCGGCGGACG GGGAAGCTTC CGGCGAGGAG
ATTCAGCGCG TCATCGCCGG CATCAAGGAA GGTGTGGACC TTGGCGGAGT CGAGAGGGCC
AGGCTGATCT CGTTCGCCAT CACAATTTTC AACAGCCCGC CGAAACAAGC GAGGGTCCTG
AAGCGGTTGG CCGACAGAAG CCCTGCAGAG CGCGCAACAA TTGCGAAAGC CGCCGTAGCC
ATCGTCGTCG GCGATGGGAC GGTCCAGCCT GACGAGGTCA GGTTTCTTGA GAAGCTCCAC
AAGGCGCTGG GTCTTCCGAA GGAACAGCTC TACTCCGAAC TGCACAAGGC AGTGCCGAGG
TCGGACGAGC CGGTCGCGAT CTCGATCGAG CAGCGTCAGG CGGGGATTCC CATTCCCAAG
GAAGCGCCGG TCCCCACGCC GGACGCCGTC GTACGCATCA GGATCGACGC CGAACGCCTC
GCGCGTGCCC AGCGGGAAAC GGCAGAAGTC TCCGAGCTCC TGGCTAACAT ATTCGAGGAG
GAGACCCCGC CTCCCGTCGA GACCGTCGCT GCCGTTGCCA ATGCATCAGC TTTCGAAGGA
CTTGACCAGT CGCACACCGA ACTGGTCGAA CTCATCGAGC TCAAAGGCGC GGTCCCGAAG
CTGGAATTCG AGGAGCGGGC CCGCGCGATG AAGCTGCTGG CGGAAGGCGC CCTCGAGCGC
ATAAACGACT GGGCCTTCGA GCGCTTCGAC GAGGCTCTGC TCGAAGACGG TGATGAAATC
GTGATGGCAC CGCACTTGCG GGAAAGGCTG TCCGAATTGA GAGAGACGGC ATGA
 
Protein sequence
MGRRKGGSAA GMGLLIVLGL IFAALSAVYR FIVENLAAIT GFGTVVGLLL ILWYLFSKLE 
GQKSSVAERP LATQQPQIQL LSQRGYVEPL VQFGSRARSA GTSGQTRWVA PTETIIVQGV
SVAGGMFYLG TALSFEGREI DEYVVNPELS AKSSRPDVEG TSMPYWPSYA EVAPAARRAF
LEWMSTGRRA KPYGIGHVFL YFYGLEHRVF ADRDVGNVPA LIAEVDQLLR VYGDNGSFHG
YATHFLDLAR CAAGLPLPVP EPSADTAYAA EMGTAVRLHL GRRLAQSAAI LSEDALIWVL
ALPDVYLRTA AVRCFDKFVP LWHLRFRSRF PEGLRVATSG NIDLSYRAAS GAFEVPVDGP
HRDYPDVTKV KTSLEPLRQL VQECTDELDG FSRLLGRRPE ARNSVQAALL LPEDLLAETV
FEAVREFGQR LSEIMGGKQL ASTKMDTVLR LANFELPDSG KLSPAVADQL GQVLDRLDIA
IEPDRRYGGG VPQPQDQVFL FNAPGGGPVD SERPAYRSMK AQVEVAVLAA AADGEASGEE
IQRVIAGIKE GVDLGGVERA RLISFAITIF NSPPKQARVL KRLADRSPAE RATIAKAAVA
IVVGDGTVQP DEVRFLEKLH KALGLPKEQL YSELHKAVPR SDEPVAISIE QRQAGIPIPK
EAPVPTPDAV VRIRIDAERL ARAQRETAEV SELLANIFEE ETPPPVETVA AVANASAFEG
LDQSHTELVE LIELKGAVPK LEFEERARAM KLLAEGALER INDWAFERFD EALLEDGDEI
VMAPHLRERL SELRETA