Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_1041 |
Symbol | |
ID | 4031647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | + |
Start bp | 1162858 |
End bp | 1165065 |
Gene Length | 2208 bp |
Protein Length | 735 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637969539 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_576349 |
Protein GI | 92116620 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.453429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTCTT ATATCGGTAC AGCCACGCCC CGCGTCGATG GGCGCGACAA GGTCACCGGC GCCGCACGCT ACGCCGGTGA GCACGGCGCA CCCGACCTTG TGCATGGCAG CGTGGTCACC TCGACCATCG CCAAAGGTCG GATCAGGCGG ATCGATACCA GCGAAGCTGC GAAGGTCGAT GGCGTCATCG CCGTCCTCAC CCACGAGAAT CGGCCGCCGA TGGCAAATAA CGACAAGGCC TACAAGGACG ACGTCGCACC CGAGCAAGGC TCACCCTATC GTCCGCTCTA CGACGGCGAG ATCAGGTTTA ACGGCCAGCC GGTCGCGCTG GTGCTGGCAG AGGATTTCGA GACCGCAACC TTTGCCGCCT CCCTCGTCCG CGTCGAATAT GACGAGCAGC CACACATCAC CGATATCGAA CGGCAGCGGG GCAACGCGGT TCCGCTCGAT GCGCCGGCCA AACCTCGCGG CGATGCCGCG CAAGCCTTTG CGGCTTCGGA TATCCGGCAT GATGCCGAAT ACTATGTGCC GGTCGAGCAT CACAATCCAA TGGAGTTGTT TGCCTCAACA GCGGTGTGGC GCGACGGCAA GCTCACGGTC TACGACAAAA CCCAGGGCGT GCAGAACGTT CAGCGCTATC TATGCGGCGT GTTCGAGGCC AGGTCCGACG AAATCCAGGT GATGTCGCCC TATATGGGCG GCGGCTTTGG TTCCGGCCTG CGACCGCAGT TTCAGGTCGT GCTGGCCGTG CTCGGCGCCC GTGCGCTCAA ACGCTCGGTT CGCGTCGTGC TGACGCGACA GCAGATGTAC GAGGTTGGCT ATCGCCCGGC GATGATCCAG CGCATCCAGC TTGGCGCGAA GCCGGACGGC ACGCTGAATG CGATCATCCA TGATGCCACC ACTACTACCT CGCAATATGA GGACTTCCAC CGCAACGAGA CCACATGGTC CGGCCTGCTC TACAAGAGCG AGACGGCAAG CTACGCGCAC AAACTCGCGC ATCTCGATCT GCCGACGTCG TGCGACATGC GCGCGCCAAG TGCTGCGACC GGTGTCTATG CGCTGGAAGC GGCGATGGAC GAACTTGCGG TCGCACTGAA GATGGACCCG CTCGAACTGC GGCTGAAATG CTATTCCGAT CGCGATCAGA TCACCGGCTT GCCCTTCAGC AGCAAAAGCT TGCGCGAATG CTACAGCCAA GGCGCGGCGG CGTTCGGCTG GAACAAGCGC AATCTCGCAC CACGCTCGAT GCGCGACGGT AACGACCTGA TTGGCTGGGG CATGGCCACC GGCATCTGGG AAGCGCTGCA GGTCCCGATC ACCGTGCGGA TCACGTTGAC GGCCAACGGC CACGCGGAAG TTGCGTGCGC GACATCCGAT ATCGGCACCG GCACCTATAC GATCATGGCG CAGGTCGCCG CCGACATGCT CGGCCTGCCG ATCGACAACG TCACCGTCAA GCTCGGCGAC TCGACGCTGC CGCAATCCCC GGTTGAAGGA GGGTCATGGA TTGCCGCGTC GGTCTCGAAC GGCATCCTCA CCACCAGCAA CGCTATCCGC GACGAACTGT TGCGGCAGGC GCAGACAATG CCGGATTCGC CGCTGAAGGG CGCGACGGCT GACGACGTGG CGCTGGCCGA TGGCAGCATT ATCTCCAAAA AGCCGCCGCG CAGCACGATT CCGATTGCGG ACGTCATGCG GCACGCCGGC GTCGACCGTA TCGCGCAGGA AAAGGCCACG CAGTTCCAGA ACGACGGCAA ACACGCTCAC AACGCCCATT CGGCGATCTT CGCCGAGGTG AAGGTGGACG AGCAGTTGGG AGTGATACGC GTGACTCGCC TCGTCAGTGC GGTCGCTGCG GGACGCATCC TCAACCTCAA GACCGCGCGC AGTCAGGTAA TGGGCGGCAT GATCTGGGGC ATCGGTATGG CGCTGCATGA GGAAACGCTG ATCGACCACC GCTTCGGCCG GATCATGAAT GCCAATATCG CCGAATATCA TGTGCCGGTG AATGCCGACG TCCACGACGT CGAGGTGATC TTCGTCGACG AGCAGGACGA CATCGTCAAC CCGATGGGCA TCAAGGGGTT GGGCGAGATT GGCATCGTCG GCGTTCCCGC GGCGATTGCC AACGCGGTCC ATCATGCCAC AGGCAAACGG GTGCGAGATC TCCCGATCAC TCTCGACAAG TTAACACGGG ACACCTAG
|
Protein sequence | MTSYIGTATP RVDGRDKVTG AARYAGEHGA PDLVHGSVVT STIAKGRIRR IDTSEAAKVD GVIAVLTHEN RPPMANNDKA YKDDVAPEQG SPYRPLYDGE IRFNGQPVAL VLAEDFETAT FAASLVRVEY DEQPHITDIE RQRGNAVPLD APAKPRGDAA QAFAASDIRH DAEYYVPVEH HNPMELFAST AVWRDGKLTV YDKTQGVQNV QRYLCGVFEA RSDEIQVMSP YMGGGFGSGL RPQFQVVLAV LGARALKRSV RVVLTRQQMY EVGYRPAMIQ RIQLGAKPDG TLNAIIHDAT TTTSQYEDFH RNETTWSGLL YKSETASYAH KLAHLDLPTS CDMRAPSAAT GVYALEAAMD ELAVALKMDP LELRLKCYSD RDQITGLPFS SKSLRECYSQ GAAAFGWNKR NLAPRSMRDG NDLIGWGMAT GIWEALQVPI TVRITLTANG HAEVACATSD IGTGTYTIMA QVAADMLGLP IDNVTVKLGD STLPQSPVEG GSWIAASVSN GILTTSNAIR DELLRQAQTM PDSPLKGATA DDVALADGSI ISKKPPRSTI PIADVMRHAG VDRIAQEKAT QFQNDGKHAH NAHSAIFAEV KVDEQLGVIR VTRLVSAVAA GRILNLKTAR SQVMGGMIWG IGMALHEETL IDHRFGRIMN ANIAEYHVPV NADVHDVEVI FVDEQDDIVN PMGIKGLGEI GIVGVPAAIA NAVHHATGKR VRDLPITLDK LTRDT
|
| |