Gene Ava_C0102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0102 
Symbol 
ID3678057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp124255 
End bp125811 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content47% 
IMG OID637715186 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_320380 
Protein GI75812763 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACTA CATACCATCA CAGTGCTGCT TTTGATTATA TTGTGATTGG CGCCGGTTCA 
GCAGGCTGCG TTGTTGCCAA CCGTCTTACA GAAGACCCTA ATACTAAAGT ATTGCTGCTC
GAAGCGGGTG ATCCTGATAC CAAGCCAGAA CTTCAAGTTC CCTCATTGTG GCCTACTACA
CTCTTAGGCT CGGAAGTGGA CTGGGCATAC TTAACTGAAG GGGAACCTTA CTTAAATAAC
CGCAAAATTT TATCTTCACG CGGTAAAGTC TTGGGCGGCA GCAGTTCGAT TAATGGCATG
ATTTATATAC GAGGCAATGA ACGTGACTAC AATAGCTGGC AAGCGTTAGG TAATATTGGT
TGGAGTTATC AGGATGTCTT GCCTTACTTC AAGAAATCGG AAAACCAGCA GCGGGGAGCA
TCGTTATTTC ACGGGGTTGA TGGACCACTT AGTATCACAG ATCCACTTTC TCCTGCAAAA
GTGTCGCAAC GCTTTGTGGA AGCCGCGATC GCACAGGGCT ATGAGCAAAA TCCCGACTTT
AATGGCGTAC AGCAGGAAGG TGCAGGACTT TACCAAGTGA CCGTGAAAGA TGGCAAGCGC
CAAAGTACAG CAGTGGCATT TCTGCGTCCG ATTAAAGATC GCCCCAACTT GACCATTCAA
ACAGGAGCAT TGGTGACTCG TTTACTCTTT GAGGGAAAGC GTGCAGTAGG GGTAGTGTAT
GTTCAAAATG GAACGGAGTA TCAAATCAGG GTCAACTCCG AAGTGATTTT GAGTGCTGGC
GCCTTCGATT CTCCTAAACT GCTCATGCTT TCTGGAATTG GACCTGCTGA ACATCTGCGG
GCAGTAGGCA TTCCTGTAGT TTTTGATTTG CCGGGTGTCG GCCAGAATCT TCAAGATCAC
CCACTTGCTG TTATTGCCTA CCAGTCTACT CAGGACGTAC CCCTTGCGCC AAGTAGTAAT
GGGGGAGAGG CTGGGTTATT TCTGCATACC AACAATAATT TAGATGCGGC ACCTAATTTG
CAATTTACAA TTGTTCCGAT TTTATATGTC GATCCTGCCT ATGCACGTGA AGGTCCGGGA
TTCACCCTTA CCTTTTACAT CACCCGTCCC GAAAGTCGTG GTAGTGTAAG ACTACGTTCC
TCCTCCCCCT TCGACCCACC GTTGATTCGC GTCAACTATC TTCAGAAAGA ATCTGACATG
CAACTGATGG TTGAAGGACT TAAAATTTTG CGTCAAATTG TGTACTCCGA TGCGTTTAAT
GAGTTTCGGG GTGAGGAAAT TGCTCCAGGG AGTTCCGTGC ATAGCGACAA AGCAATCGAA
GATTATATTC GGCAAACGTG CGGTACGGGA TGGCATCCTG TTGGGACGTG CAAAATGGGT
ATTGATCAAA TGGCGGTTGT CGATCCTCAA CTCAAGGTAC GGGGGATTGA AGGGTTACGA
GTTGTTGATG CATCGATTAT GCCAACTATG ATCACAGGAA ACACAAATGC ATCGGCAATT
ATGATTGGAG AAAAGGCTGC CGATTTGATA AAAGTTGGAA CAAAATTGCC TCAATGA
 
Protein sequence
MTTTYHHSAA FDYIVIGAGS AGCVVANRLT EDPNTKVLLL EAGDPDTKPE LQVPSLWPTT 
LLGSEVDWAY LTEGEPYLNN RKILSSRGKV LGGSSSINGM IYIRGNERDY NSWQALGNIG
WSYQDVLPYF KKSENQQRGA SLFHGVDGPL SITDPLSPAK VSQRFVEAAI AQGYEQNPDF
NGVQQEGAGL YQVTVKDGKR QSTAVAFLRP IKDRPNLTIQ TGALVTRLLF EGKRAVGVVY
VQNGTEYQIR VNSEVILSAG AFDSPKLLML SGIGPAEHLR AVGIPVVFDL PGVGQNLQDH
PLAVIAYQST QDVPLAPSSN GGEAGLFLHT NNNLDAAPNL QFTIVPILYV DPAYAREGPG
FTLTFYITRP ESRGSVRLRS SSPFDPPLIR VNYLQKESDM QLMVEGLKIL RQIVYSDAFN
EFRGEEIAPG SSVHSDKAIE DYIRQTCGTG WHPVGTCKMG IDQMAVVDPQ LKVRGIEGLR
VVDASIMPTM ITGNTNASAI MIGEKAADLI KVGTKLPQ