Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2828 |
Symbol | |
ID | 3705577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 3204298 |
End bp | 3206172 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637739304 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_344805 |
Protein GI | 77166280 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.177049 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCCA TTCCGGAAGA ATTTCTGTCC ACCCAAGCCC AGGTGGATGA GCAAGCCATC CAACCCTTTC CCAATTCACG CAAAATCCAT GTCGCGGGTA GTCGTCCCGA CATTCGTGTC CCCATGCGAG AAATCACGCT TTCCGACACC CATACGAGTC AGGGGCGAGA GAAAAACCCC CCCTTAACTG TCTATGATAC CTCGGGTCCT TATACGGACC CGGAAGCCAA AATTGATATC CGCCAAGGCC TGTCGGAACT CCGTCGAAAC TGGATTGAAG AACGGGCAGA TACCGAAATA TTATCTGATC TTTCCTCCCA ATACGGCCGT CAGAGAAACG CTGACTCCAA GCTGGATTCC TTACGTTTTG CCCATCTACG CCCACCCCGC CGAGCCAAAG CTGGGCATAA TGTGAGCCAG ATGCACTACG CTCGACAAGG AATCATCACC CCAGAAATGG AATTTATTGC TATCCGCGAA AACCAGCGCT TGGAGCAATA CCGGGAGCAG CTTGCCCAGC ACCATCCCGG CCAGTCATTC GGTGCCCATC TACCGTCCCG GCTAACCCCC GAATTCGTAC GCTCGGAAGT CGCTCGCGGC CGCGCCATTA TCCCAGCCAA TATCAACCAT ACCGAGTTGG AGCCCATGAT CATCGGGCGT AACTTCTTGG TCAAGATTAA TGCCAATATT GGCAACTCCG CCGTAACCTC TAGCATTGCC GAGGAAGTGG ATAAAATGAC CTGGGCCATC CGTTGGGGTG CCGACACGGT GATGGATCTT TCCACTGGCA AAAATATCCA CGAAACACGG GAATGGATAG TACGCAACTC TCCCGTCCCC ATTGGCACCG TGCCCATTTA CCAGGCCCTC GAAAAAGTGG GGGGAAAAGC CGAAGAGCTG ACCTGGGAGA TCTTCCGCGA CACCTTAATT GAACAAGCCG AACAGGGCGT GGATTATTTC ACCATTCACG CCGGGGTGCG CCTGGCCTAT GTGCCTTTAA CCGCTAAGCG GCTAACCGGT ATCGTCTCCC GGGGCGGCTC TATCATGGCC AAATGGTGTC TTGCCCATCA CACGGAAAGT TTCCTCTACA CTCATTTTGA GGAAATTTGC GAAATTATGA AAGCTTACGA TGTTTCTTTC TCCTTAGGAG ATGGCCTGCG GCCTGGCTCT CTTGCCGATG CCAACGATGC AGCTCAATTT GCCGAGCTTG AAACCCTGGG AGAACTTACC GAAATCGCTT GGAAACATGA TGTCCAAACC ATGATCGAAG GCCCAGGCCA TGTCCCCATG CATCTTATTA AGGAAAATAT GGACAAACAG TTGGCTTGTT GTGGCGAGGC GCCTTTCTAC ACCTTGGGAC CTTTAACCAC CGACATTGCG CCAGGCTACG ACCACATCAC CTCTGGCATC GGCGCGGCTA TGATTGGCTG GTATGGCACT GCCATGCTCT GTTACGTCAC CCCCAAGGAG CACCTAGGAT TACCCAATAA AAATGATGTC AAGGAAGGCA TTATTACTTA TAAAATCGCG GCCCACGCAG CTGACCTAGC CAAAGGCCAC CCCAGCGCCC AAATCAGAGA TAATGCCATG TCCAAAGCAA GATTTGAGTT TCGCTGGGAA GATCAGTTCA ACATTGGTTT AGATCCTGAT CAGGCACGGG AGTATCACGA TGAGACCTTG CCCAAAGACT CAGCCAAAGT AGCCCACTTC TGTTCCATGT GCGGTCCTCA ATTTTGCTCC ATGAAGATCT CCCAGGACGT GCGCGAATAT GCTAAACAGA AGGGTCTAAA GCACCATACC GCTCTGGAAC AAGGTATGGC GGAAAAAGCC CAGGAATTTA GGGAAAAGGG TGCGGAGATT TACCACGAGA CCTGA
|
Protein sequence | MSAIPEEFLS TQAQVDEQAI QPFPNSRKIH VAGSRPDIRV PMREITLSDT HTSQGREKNP PLTVYDTSGP YTDPEAKIDI RQGLSELRRN WIEERADTEI LSDLSSQYGR QRNADSKLDS LRFAHLRPPR RAKAGHNVSQ MHYARQGIIT PEMEFIAIRE NQRLEQYREQ LAQHHPGQSF GAHLPSRLTP EFVRSEVARG RAIIPANINH TELEPMIIGR NFLVKINANI GNSAVTSSIA EEVDKMTWAI RWGADTVMDL STGKNIHETR EWIVRNSPVP IGTVPIYQAL EKVGGKAEEL TWEIFRDTLI EQAEQGVDYF TIHAGVRLAY VPLTAKRLTG IVSRGGSIMA KWCLAHHTES FLYTHFEEIC EIMKAYDVSF SLGDGLRPGS LADANDAAQF AELETLGELT EIAWKHDVQT MIEGPGHVPM HLIKENMDKQ LACCGEAPFY TLGPLTTDIA PGYDHITSGI GAAMIGWYGT AMLCYVTPKE HLGLPNKNDV KEGIITYKIA AHAADLAKGH PSAQIRDNAM SKARFEFRWE DQFNIGLDPD QAREYHDETL PKDSAKVAHF CSMCGPQFCS MKISQDVREY AKQKGLKHHT ALEQGMAEKA QEFREKGAEI YHET
|
| |