Gene PCC8801_3028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3028 
Symbol 
ID7105436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3158314 
End bp3159351 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content34% 
IMG OID643476055 
ProductTaurine catabolism dioxygenase TauD/TfdA 
Protein accessionYP_002373168 
Protein GI218247797 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCCA AACCTTTAAA AACCATTAAA CGACGGGCAG TCAATATTGC TGCTTCCCAG 
TTAGTAACTG TTTCTTGTTT TGAGCAAAAA CCGATTCCAA TTATTATTCA GCCTAATCAA
AATAATTTAG ACTTAATTGC TTGGGCAACT TATCATCAAG AAGTAATTAA TAACTATTTA
CAGCAACAAG GTGCTATTCT ATTTCGAGGG TTTAGCATCA ATAAATTGGC ACAGTTTGAA
GAGTTAATGA CAGCTCTTTT TGGTTCCCTT TTAGATTATT CTTATGGTTC AACCCCAAGA
CATAAAGTTA AAGGAAGTAT TTATACTTCA ACGGAATATC CCCCTGAGCA ATTTATTCCC
TTACATAATG AGATGTCTTA TGCTTCAAAT TGGCCAGAGA AAATTGGATT TTTCTGTTTA
AAAGCAGCTA CACAAGGGGG AGAAACACCT ATTGCTAATA GTCGTCGCAT TTTTCAACGG
ATTGATCCTA AAATTAGAGA AAAGTTTCAA GAAAAAGGAA TACTGTATGT GAGAAATTAC
AGTGAACAGT TAGATTTGCC TTGGCAAAAA GTTTTTCAAA CCACTAATAA ATTACAGGTT
GAAAACTATT GTCGTCAATC AGGAATTGAA TGGGAATGGA ATGACAATCA TTTAAAAACT
CGTCAAATTT GTCAAGCAGT TGCTAATCAT CCCCAAACTA ATGAAATGGT ATGGTTTAAT
CAAGCTCATT TATTCCATGT TTCTAGTTTA AATTCATCTT TTAGAGATAG TCTTCTAGAA
GTATTAAAAG AGGAAGATTT ACCCCGTAAT GCTTATTATG GTGATGGTAC TCCTTTAGAA
GTTTCTGTTT TGGAGGAAAT TCGCACAATT TATCAAGAAG AAATGGTGAT ATTTTCTTGG
CAATCAGGAG ATTTATTATT ACTAGATAAT ATGTTAACGG CTCATGGACG AATGCCGTTT
ACCGGAGAGC GACGAGTGGT TGTCGCTATG GCTCAACCCC ATGATTTGGT CGTTAAAACT
TGGACAACCT TAATTTAG
 
Protein sequence
MISKPLKTIK RRAVNIAASQ LVTVSCFEQK PIPIIIQPNQ NNLDLIAWAT YHQEVINNYL 
QQQGAILFRG FSINKLAQFE ELMTALFGSL LDYSYGSTPR HKVKGSIYTS TEYPPEQFIP
LHNEMSYASN WPEKIGFFCL KAATQGGETP IANSRRIFQR IDPKIREKFQ EKGILYVRNY
SEQLDLPWQK VFQTTNKLQV ENYCRQSGIE WEWNDNHLKT RQICQAVANH PQTNEMVWFN
QAHLFHVSSL NSSFRDSLLE VLKEEDLPRN AYYGDGTPLE VSVLEEIRTI YQEEMVIFSW
QSGDLLLLDN MLTAHGRMPF TGERRVVVAM AQPHDLVVKT WTTLI