Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_2000 |
Symbol | |
ID | 4031553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | + |
Start bp | 2222476 |
End bp | 2224869 |
Gene Length | 2394 bp |
Protein Length | 797 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637970457 |
Product | hypothetical protein |
Protein accession | YP_577259 |
Protein GI | 92117530 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGCGGC GCAAAGGCGG TTCTGCAGCA GGGATGGGTC TGCTCATCGT TCTGGGTCTG ATCTTTGCGG CCTTGAGCGC GGTCTATCGA TTCATTGTCG AAAATCTGGC GGCGATCACA GGGTTCGGCA CCGTCGTGGG ATTGCTGCTG ATTCTGTGGT ATCTATTCTC GAAGCTGGAA GGCCAAAAGT CGTCCGTCGC AGAGCGGCCA TTAGCTACTC AACAACCGCA AATCCAGCTT CTCAGCCAGC GCGGATACGT CGAGCCTCTA GTGCAGTTCG GGTCCCGGGC CAGGTCCGCA GGGACGTCCG GCCAGACCAG GTGGGTAGCC CCGACCGAAA CGATCATCGT CCAAGGCGTA TCCGTTGCCG GAGGGATGTT CTATCTCGGC ACCGCCCTTT CTTTCGAGGG ACGGGAAATC GACGAATACG TCGTCAATCC CGAGCTTTCG GCCAAGTCAT CCAGACCGGA CGTCGAGGGC ACTTCAATGC CCTACTGGCC GTCCTATGCG GAGGTGGCAC CGGCGGCCAG GCGGGCGTTC CTTGAGTGGA TGTCGACGGG CAGGCGGGCC AAGCCATATG GAATCGGGCA CGTCTTTCTG TATTTCTACG GTCTGGAGCA CCGAGTATTT GCGGACCGGG ACGTAGGAAA CGTGCCCGCT TTGATTGCGG AGGTGGACCA GCTCCTTAGG GTCTACGGCG ATAACGGCTC TTTTCATGGT TACGCAACGC ACTTCCTCGA TCTCGCGCGC TGCGCGGCCG GCTTGCCGCT TCCCGTTCCC GAGCCATCTG CCGACACAGC TTATGCCGCC GAAATGGGTA CGGCGGTCCG CCTCCACCTT GGGCGGCGCC TGGCACAATC GGCCGCGATT CTGTCCGAAG ACGCGCTGAT CTGGGTTCTA GCCTTGCCCG ACGTCTATCT CCGGACCGCG GCGGTCCGGT GCTTCGATAA ATTCGTGCCC CTGTGGCATC TCCGTTTCCG GAGCAGGTTT CCGGAAGGAC TGAGGGTCGC GACGTCAGGC AACATCGACT TGAGCTACCG GGCTGCAAGC GGTGCATTCG AAGTGCCGGT TGATGGGCCC CATCGCGATT ATCCGGACGT GACGAAAGTG AAGACTTCTC TCGAACCGTT GAGACAGCTT GTGCAGGAAT GTACGGACGA ACTCGATGGA TTCAGCCGAT TGCTAGGCCG ACGTCCCGAA GCTCGAAACT CCGTCCAAGC CGCGCTATTG CTTCCGGAGG ATCTGTTGGC CGAAACCGTT TTCGAGGCTG TGCGCGAGTT CGGACAGCGG CTCTCGGAGA TCATGGGCGG CAAGCAACTC GCCAGCACGA AGATGGATAC GGTGCTTCGG TTGGCCAATT TCGAGTTGCC AGACAGCGGC AAGTTGTCGC CCGCGGTCGC CGACCAACTG GGCCAAGTGC TCGATCGCCT CGACATCGCA ATCGAACCGG ACCGGCGTTA CGGGGGAGGG GTCCCGCAGC CGCAAGACCA GGTGTTTCTG TTCAACGCCC CCGGGGGAGG CCCCGTGGAT TCGGAGCGGC CCGCTTACCG ATCGATGAAG GCGCAGGTCG AAGTTGCAGT GCTGGCGGCT GCGGCGGACG GGGAAGCTTC CGGCGAGGAG ATTCAGCGCG TCATCGCCGG CATCAAGGAA GGTGTGGACC TTGGCGGAGT CGAGAGGGCC AGGCTGATCT CGTTCGCCAT CACAATTTTC AACAGCCCGC CGAAACAAGC GAGGGTCCTG AAGCGGTTGG CCGACAGAAG CCCTGCAGAG CGCGCAACAA TTGCGAAAGC CGCCGTAGCC ATCGTCGTCG GCGATGGGAC GGTCCAGCCT GACGAGGTCA GGTTTCTTGA GAAGCTCCAC AAGGCGCTGG GTCTTCCGAA GGAACAGCTC TACTCCGAAC TGCACAAGGC AGTGCCGAGG TCGGACGAGC CGGTCGCGAT CTCGATCGAG CAGCGTCAGG CGGGGATTCC CATTCCCAAG GAAGCGCCGG TCCCCACGCC GGACGCCGTC GTACGCATCA GGATCGACGC CGAACGCCTC GCGCGTGCCC AGCGGGAAAC GGCAGAAGTC TCCGAGCTCC TGGCTAACAT ATTCGAGGAG GAGACCCCGC CTCCCGTCGA GACCGTCGCT GCCGTTGCCA ATGCATCAGC TTTCGAAGGA CTTGACCAGT CGCACACCGA ACTGGTCGAA CTCATCGAGC TCAAAGGCGC GGTCCCGAAG CTGGAATTCG AGGAGCGGGC CCGCGCGATG AAGCTGCTGG CGGAAGGCGC CCTCGAGCGC ATAAACGACT GGGCCTTCGA GCGCTTCGAC GAGGCTCTGC TCGAAGACGG TGATGAAATC GTGATGGCAC CGCACTTGCG GGAAAGGCTG TCCGAATTGA GAGAGACGGC ATGA
|
Protein sequence | MGRRKGGSAA GMGLLIVLGL IFAALSAVYR FIVENLAAIT GFGTVVGLLL ILWYLFSKLE GQKSSVAERP LATQQPQIQL LSQRGYVEPL VQFGSRARSA GTSGQTRWVA PTETIIVQGV SVAGGMFYLG TALSFEGREI DEYVVNPELS AKSSRPDVEG TSMPYWPSYA EVAPAARRAF LEWMSTGRRA KPYGIGHVFL YFYGLEHRVF ADRDVGNVPA LIAEVDQLLR VYGDNGSFHG YATHFLDLAR CAAGLPLPVP EPSADTAYAA EMGTAVRLHL GRRLAQSAAI LSEDALIWVL ALPDVYLRTA AVRCFDKFVP LWHLRFRSRF PEGLRVATSG NIDLSYRAAS GAFEVPVDGP HRDYPDVTKV KTSLEPLRQL VQECTDELDG FSRLLGRRPE ARNSVQAALL LPEDLLAETV FEAVREFGQR LSEIMGGKQL ASTKMDTVLR LANFELPDSG KLSPAVADQL GQVLDRLDIA IEPDRRYGGG VPQPQDQVFL FNAPGGGPVD SERPAYRSMK AQVEVAVLAA AADGEASGEE IQRVIAGIKE GVDLGGVERA RLISFAITIF NSPPKQARVL KRLADRSPAE RATIAKAAVA IVVGDGTVQP DEVRFLEKLH KALGLPKEQL YSELHKAVPR SDEPVAISIE QRQAGIPIPK EAPVPTPDAV VRIRIDAERL ARAQRETAEV SELLANIFEE ETPPPVETVA AVANASAFEG LDQSHTELVE LIELKGAVPK LEFEERARAM KLLAEGALER INDWAFERFD EALLEDGDEI VMAPHLRERL SELRETA
|
| |