Gene PCC8801_0144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0144 
Symbol 
ID7104730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp149943 
End bp151049 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content48% 
IMG OID643473259 
Productbiotin synthase 
Protein accessionYP_002370406 
Protein GI218245035 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTTCAAG CCCCCTTATC ACCGTCTCTT CTTCAGACTC AATCCATTTC CCAACCGCCT 
CAAGAAACAG AGGCATTAAA AGCATGGCTT GAGGAATTAA CCCAAAAAAT CATCGAGGGC
GATCGCATCA ATAAATCAGA AGCCCTCACC CTGACCCAAA TTGAAGGTCA AGACTCTATT
CTTTTGCTAT GCGAAGCAGC CGATCGCATC AGACAAGCTT GTTGTGGCAA TGTAGTCGAT
CTGTGTAGCA TTATCAACAT TAAATCCGGC AACTGTTCAG AAAATTGTCG CTTCTGTTCC
CAGTCAGTTT ACCATCCAGG AGAAAATTCC CCCATTTATG GGCTAAAATC CTCAGAGGAA
ATTCTCGCTC AAGCCAAAGC GGCTGAAGCG GCCGGGGCAA AACGCTTTTG TCTGGTCAGT
CAGGGACGAG GACCGAAATA TCAAGGAGCA AAATCCAAGG AATTTGAGCA AATCTTAGCA
ACCGTTCGGC AAATTGCCGC CGAAACCTCT ATTAAACCCT GCTGCGCTCT AGGGGAAGTG
ACCCCAGAAC AAGCCCAGGC TTTAAGAGAA GCCGGAGTTA CCCGCTATAA CCACAATTTA
GAAGCCTCAG AAGGATTTTA TCCCGAAATC GTCACCAGTC ATAGTTGGCG CGATCGCGTG
GAAACCATTA AAAACCTCAA AGCAGCCGGG ATTCAAGCTT GTAGCGGCGG AATCATGGGC
ATGGGAGAAA CTTGGGAAGA TCGGGTGGAT TTAGCCCTCG CTTTGCGGGA ATTAGGCGTA
GAATCGGTTC CGATTAACCT CCTCAACCCC AGAGAAGGAA CCCCATTAGG AGACTGTCAT
CGTCTAGATC CCTTTGAAGC TCTCAAGGCG ATCGCTATTT TTCGCTTGAT TCTCCCTCAA
CAAATCCTGC GCTACGCGGG TGGACGGGAA GCGATTATGG GAGACTTACA AAGTCTAGGG
CTAAAATCGG GAATTAATGC TATGCTGATT GGACATTATC TAACAACTCT AGGACAACCA
CCAGAGAAAG ATCTGGCTAT GGTTGAATCT TTAGGCTTGC AAGGGGGTGA AGCTCCAATT
CCTGGTGAAT ATCAAACGCG ATCGTAA
 
Protein sequence
MVQAPLSPSL LQTQSISQPP QETEALKAWL EELTQKIIEG DRINKSEALT LTQIEGQDSI 
LLLCEAADRI RQACCGNVVD LCSIINIKSG NCSENCRFCS QSVYHPGENS PIYGLKSSEE
ILAQAKAAEA AGAKRFCLVS QGRGPKYQGA KSKEFEQILA TVRQIAAETS IKPCCALGEV
TPEQAQALRE AGVTRYNHNL EASEGFYPEI VTSHSWRDRV ETIKNLKAAG IQACSGGIMG
MGETWEDRVD LALALRELGV ESVPINLLNP REGTPLGDCH RLDPFEALKA IAIFRLILPQ
QILRYAGGRE AIMGDLQSLG LKSGINAMLI GHYLTTLGQP PEKDLAMVES LGLQGGEAPI
PGEYQTRS