Gene A9601_00481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_00481 
SymboldadA 
ID4716730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp51387 
End bp52496 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content32% 
IMG OID640077745 
Productputative thiamine biosynthesis oxidoreductase 
Protein accessionYP_001008443 
Protein GI123967585 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR02352] glycine oxidase ThiO 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.254103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACAAG AAACCAAAAA TTCAATATTA ATCATTGGAG GTGGACTTTT AGGTTTATCT 
ATTGCTTATG AATTTTCAAG AAATAGCTTC AAAGTTTTAG TTTTAAGCAA AAACAGAAAT
GAATCAGCTG GATTTGTTGC TGCAGGAATG TTAGCTACTC ATGCTGAAGG GCTCGAAGAT
GAATTACTAA AATTTGGCCA AGAAAGTCAA AATCTAATTC CAAAGTGGAT ACAAAGTATT
GAACAAGATA GTAATATTAA ATGCGGTTTA AAAAAATGTG GCATTGTAGT TCCTTTTAAA
AACAAAGAAG ATCTTGAAGC GTTTCCCACT TATGAATATG GAAAATATTT AAATCACAAA
GATCTTCAAA CAGAAATCAA TGGAATGAAT TCTATTTGGA AACATGGTTT ACTTTTTGAA
CAAGATGGTC AAATAGATAA CCGAAGAAAA CTGATGCGTG CTCTTGAGAG AGCATGCTCC
TTGAATGGAG TCGAATTTCA AGAAGGATCA GAAGTAGAGG ATTTAACATT CGAAAAAAAC
AAAATTACAG GTGCAACAGT TTTATGTGCC ACTGGGGAAA TAAAAAAAAT TAACTGCGAA
AAAGCAATTA TATGCAGCGG TGCGTGGAGT AAAAAAATTT TTAAAAAAAT TCCAGTCTTT
CCTGTAAAGG GACAAATGCT ATCAATACAA GGTCCAACAA ATTTTTTAAA AAGGGTTATT
TTTGGTCCAA AAACTTATCT AGTACCCCGT GATGATGGAC TTATTATAGT TGGAGCGACA
GTTGAAAAAG ATTCAAAATT TAATCAAGGT AATACTCCTA ATGGAATAAA ACAACTGCAA
GAAGGCATTC GCTCTTTATT GCCAGAAGCT ATTAATTGGC CACAAATGGA ACATTGGTGG
GGCTTTAGAC CTTGCACACC AGATCTAAAA CCAATAATTG GAAAATCAAA AATTGAAAAT
CTTTATATAG CTACAGGACA TTACAGAAAT GGAGTTTTAT TTTCTGCAAT AACAAGTGAT
CTTCTTTTGA AAATAGTTCA AAATAAAAAT CTCAAAGAAA TAGAGAAAAG CTTTTTAGAA
AAATTTAGTT TAGATAGATT TGCGATTTAA
 
Protein sequence
MAQETKNSIL IIGGGLLGLS IAYEFSRNSF KVLVLSKNRN ESAGFVAAGM LATHAEGLED 
ELLKFGQESQ NLIPKWIQSI EQDSNIKCGL KKCGIVVPFK NKEDLEAFPT YEYGKYLNHK
DLQTEINGMN SIWKHGLLFE QDGQIDNRRK LMRALERACS LNGVEFQEGS EVEDLTFEKN
KITGATVLCA TGEIKKINCE KAIICSGAWS KKIFKKIPVF PVKGQMLSIQ GPTNFLKRVI
FGPKTYLVPR DDGLIIVGAT VEKDSKFNQG NTPNGIKQLQ EGIRSLLPEA INWPQMEHWW
GFRPCTPDLK PIIGKSKIEN LYIATGHYRN GVLFSAITSD LLLKIVQNKN LKEIEKSFLE
KFSLDRFAI