Gene NATL1_14871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_14871 
SymbolbioB 
ID4779672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1197337 
End bp1198374 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content40% 
IMG OID640084768 
Productbiotin synthase 
Protein accessionYP_001015309 
Protein GI124026193 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.598492 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTAA TTAATCCTAA TATCCAGGAA TCTAATAAAC TCAAGTTCAA AGACGAATCA 
TATTTAGATT TTAATTCCAT AAATGGTGGA GATATTAGAC ATGATTGGTC TTCAGAAGAA
ATCAAAGAAA TACTTGATTT GCCGTTAATG GATTTGTTGT GGAGAGCTCA AATAGTTCAT
AGGTCTTACA ATCCCGGTTA TAAAGTTCAG CTTGCTTCTC TTCTAAGTGT GAAGACAGGT
GGATGCTCAG AAGACTGTGC ATATTGTCCT CAATCTGTTC ACAATGAAAC AACTGTTCAA
CCTAATCCTG TAATTGAAGT TGAGTCAGTT CTTGATAGAG CAAGAGCTGC AAAAGATGCA
GGAGCAGATA GATTTTGCAT GGGTTGGGCT TGGCGTGAGA TCAAAGACGG AAACGCATTC
GATTCAATGC TTGAAATGGT AAGAGGTGTT AGAGAGCTTG GCCTTGAGGC ATGTGTCACC
GCTGGAATGA TTACTGATTC TCAAGCCTCT AGATTGGCAG AAGCAGGTTT AACAGCCTAT
AACCATAATT TAGATACTAG TCCTGAGCAT TATTCCAAAA TCATTTCAAC AAGAACATAT
CAAGATCGAC TTGAAACATT GAGAAGAGTA CGCATGGCTG GAATTACAGT GTGCTGTGGT
GGGATTATTG GCATGGGGGA ATCTGTTTCA GATAGAGCAT CTTTACTTAA GGTTTTAGCA
ACTTTAGACC CGCATCCTGA AAGTGTACCT ATTAATGCGT TGGTTGCAGT GGAGGGGACA
CCCATGGAGG ATTTGTCTTC TATCGATCCA TTAGAGATGG TTCGTATGGT CGCGACGGCA
AGGGTTATTA TGCCTAAAAG CCGAATAAGA CTTAGCGCAG GGAGACAACA ATTAGGTAGG
GAAGCTCAGA TTCTATGTCT ACAATCTGGA GCTGATTCTA TATTTTATGG AGATACACTT
TTAACTACAA GCAATCCGGA GGTGGAAGCA GACCGTAAGC TTTTAGCGGA TGCTGGAATT
ACGGCTAATT TCTCTTAA
 
Protein sequence
MTLINPNIQE SNKLKFKDES YLDFNSINGG DIRHDWSSEE IKEILDLPLM DLLWRAQIVH 
RSYNPGYKVQ LASLLSVKTG GCSEDCAYCP QSVHNETTVQ PNPVIEVESV LDRARAAKDA
GADRFCMGWA WREIKDGNAF DSMLEMVRGV RELGLEACVT AGMITDSQAS RLAEAGLTAY
NHNLDTSPEH YSKIISTRTY QDRLETLRRV RMAGITVCCG GIIGMGESVS DRASLLKVLA
TLDPHPESVP INALVAVEGT PMEDLSSIDP LEMVRMVATA RVIMPKSRIR LSAGRQQLGR
EAQILCLQSG ADSIFYGDTL LTTSNPEVEA DRKLLADAGI TANFS