Gene Noc_2828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2828 
Symbol 
ID3705577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3204298 
End bp3206172 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content52% 
IMG OID637739304 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_344805 
Protein GI77166280 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.177049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCCA TTCCGGAAGA ATTTCTGTCC ACCCAAGCCC AGGTGGATGA GCAAGCCATC 
CAACCCTTTC CCAATTCACG CAAAATCCAT GTCGCGGGTA GTCGTCCCGA CATTCGTGTC
CCCATGCGAG AAATCACGCT TTCCGACACC CATACGAGTC AGGGGCGAGA GAAAAACCCC
CCCTTAACTG TCTATGATAC CTCGGGTCCT TATACGGACC CGGAAGCCAA AATTGATATC
CGCCAAGGCC TGTCGGAACT CCGTCGAAAC TGGATTGAAG AACGGGCAGA TACCGAAATA
TTATCTGATC TTTCCTCCCA ATACGGCCGT CAGAGAAACG CTGACTCCAA GCTGGATTCC
TTACGTTTTG CCCATCTACG CCCACCCCGC CGAGCCAAAG CTGGGCATAA TGTGAGCCAG
ATGCACTACG CTCGACAAGG AATCATCACC CCAGAAATGG AATTTATTGC TATCCGCGAA
AACCAGCGCT TGGAGCAATA CCGGGAGCAG CTTGCCCAGC ACCATCCCGG CCAGTCATTC
GGTGCCCATC TACCGTCCCG GCTAACCCCC GAATTCGTAC GCTCGGAAGT CGCTCGCGGC
CGCGCCATTA TCCCAGCCAA TATCAACCAT ACCGAGTTGG AGCCCATGAT CATCGGGCGT
AACTTCTTGG TCAAGATTAA TGCCAATATT GGCAACTCCG CCGTAACCTC TAGCATTGCC
GAGGAAGTGG ATAAAATGAC CTGGGCCATC CGTTGGGGTG CCGACACGGT GATGGATCTT
TCCACTGGCA AAAATATCCA CGAAACACGG GAATGGATAG TACGCAACTC TCCCGTCCCC
ATTGGCACCG TGCCCATTTA CCAGGCCCTC GAAAAAGTGG GGGGAAAAGC CGAAGAGCTG
ACCTGGGAGA TCTTCCGCGA CACCTTAATT GAACAAGCCG AACAGGGCGT GGATTATTTC
ACCATTCACG CCGGGGTGCG CCTGGCCTAT GTGCCTTTAA CCGCTAAGCG GCTAACCGGT
ATCGTCTCCC GGGGCGGCTC TATCATGGCC AAATGGTGTC TTGCCCATCA CACGGAAAGT
TTCCTCTACA CTCATTTTGA GGAAATTTGC GAAATTATGA AAGCTTACGA TGTTTCTTTC
TCCTTAGGAG ATGGCCTGCG GCCTGGCTCT CTTGCCGATG CCAACGATGC AGCTCAATTT
GCCGAGCTTG AAACCCTGGG AGAACTTACC GAAATCGCTT GGAAACATGA TGTCCAAACC
ATGATCGAAG GCCCAGGCCA TGTCCCCATG CATCTTATTA AGGAAAATAT GGACAAACAG
TTGGCTTGTT GTGGCGAGGC GCCTTTCTAC ACCTTGGGAC CTTTAACCAC CGACATTGCG
CCAGGCTACG ACCACATCAC CTCTGGCATC GGCGCGGCTA TGATTGGCTG GTATGGCACT
GCCATGCTCT GTTACGTCAC CCCCAAGGAG CACCTAGGAT TACCCAATAA AAATGATGTC
AAGGAAGGCA TTATTACTTA TAAAATCGCG GCCCACGCAG CTGACCTAGC CAAAGGCCAC
CCCAGCGCCC AAATCAGAGA TAATGCCATG TCCAAAGCAA GATTTGAGTT TCGCTGGGAA
GATCAGTTCA ACATTGGTTT AGATCCTGAT CAGGCACGGG AGTATCACGA TGAGACCTTG
CCCAAAGACT CAGCCAAAGT AGCCCACTTC TGTTCCATGT GCGGTCCTCA ATTTTGCTCC
ATGAAGATCT CCCAGGACGT GCGCGAATAT GCTAAACAGA AGGGTCTAAA GCACCATACC
GCTCTGGAAC AAGGTATGGC GGAAAAAGCC CAGGAATTTA GGGAAAAGGG TGCGGAGATT
TACCACGAGA CCTGA
 
Protein sequence
MSAIPEEFLS TQAQVDEQAI QPFPNSRKIH VAGSRPDIRV PMREITLSDT HTSQGREKNP 
PLTVYDTSGP YTDPEAKIDI RQGLSELRRN WIEERADTEI LSDLSSQYGR QRNADSKLDS
LRFAHLRPPR RAKAGHNVSQ MHYARQGIIT PEMEFIAIRE NQRLEQYREQ LAQHHPGQSF
GAHLPSRLTP EFVRSEVARG RAIIPANINH TELEPMIIGR NFLVKINANI GNSAVTSSIA
EEVDKMTWAI RWGADTVMDL STGKNIHETR EWIVRNSPVP IGTVPIYQAL EKVGGKAEEL
TWEIFRDTLI EQAEQGVDYF TIHAGVRLAY VPLTAKRLTG IVSRGGSIMA KWCLAHHTES
FLYTHFEEIC EIMKAYDVSF SLGDGLRPGS LADANDAAQF AELETLGELT EIAWKHDVQT
MIEGPGHVPM HLIKENMDKQ LACCGEAPFY TLGPLTTDIA PGYDHITSGI GAAMIGWYGT
AMLCYVTPKE HLGLPNKNDV KEGIITYKIA AHAADLAKGH PSAQIRDNAM SKARFEFRWE
DQFNIGLDPD QAREYHDETL PKDSAKVAHF CSMCGPQFCS MKISQDVREY AKQKGLKHHT
ALEQGMAEKA QEFREKGAEI YHET