Gene Ava_C0126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0126 
Symbol 
ID3678062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp154229 
End bp156520 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content58% 
IMG OID637715209 
Productxanthine dehydrogenase, molybdenum binding subunit apoprotein 
Protein accessionYP_320403 
Protein GI75812786 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0683441 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGAT ACATCGGAAA AGAAATGAGC CGCGTAGACG GCATCGCCAA AGTGACGGGC 
AAGGCAAAAT ACGCCGCAGA ATTTCAGGTT TCCAATCTCG CTTACGGTTT TATCGTGCTG
GGAACCGTAG CAAAGGGCAC GATTAAATCA ATCGACACGG GTGAGGCTGA ACGCGCTGCG
GGCGTAATTC GCGTCTTCAC GCACCTGAAC ACACCAAAGC TCGATCCGAA GTTTTCTATG
GTGCAGGCTC CGTCGCGCGC TACCGATGAG CAAGACAAGT CGTTTCGGGC GCTCCAGTCC
GACAAAATCT TCTTCAATAT GCAGCCTGTG GCACTCGTCG TTGCTGAGAC ATACGAGCAG
GCGCGTTATG CGGCACGTCT GGTCAAAGTC TCCTACAACA CAGAGCCGCA CACGACCGAC
ACCGAGGCGG TGCGAGCGAT CGCGCGCGTC CCCACCCAAG CGCCGCCTCT AAAGCCGCGC
GGTAATCCCG AAGAGACGAT GCGTACAGCG GCGATTGAGA CTCAGCCGTT AGTTGTTGAA
CATACTTTAA CGCGCGGTAA TCCGGAAGAG ACGATGCGTA CAGCGGCTGT GAAGGTAGAG
GCCGAGTACC GCATTCCTAT CGAGCACCAC AACCCCATAG AGCCGCACGC GGCCATCGCG
GTCTGGCAAG GCGACAAGCT CACGATCTTC GACAAGACGC AGGGGGTCTA TGGCGTGCGT
GCACATTTGG CTTCGAGCTT CGGCGTGCCT GAGGAAAACG TGAGCGTGCT TTCACCCTTC
GTCGGCGGGG CTTTCGGATC GTCGCTGCGC CCGAACTATT ATCCGGCGCT GACAGCGATG
GCGGCCCGGG AACTCAAACG CCCAGTCAAG GTCGTTTATA CGCGCACACA AATGTTCACT
GGCCACGGCT ATCGTCCGTA CACGATTCAG AAGGTCGCGC TCGGCGCAGA GCGATCGGGA
AAGCTCTCCA CGATGATTCA TGAAGTCGTA CACAACACCT CCAACATCGA GGAGTTCTCA
GACGAAACCA CACTTTTCAC GCGCCAGGTC TACGCCTGCC CGAACCTCTA CGCGCCGCTG
AAGATTACCA ACACTGACCT CCCCACCCCG ACCTGGATGC GTGCACCGGG GGCCGTTAGC
GGTATGTTTG CGCTCGAATG CGCGATGGAC GAGCTGGCCT ACGCGCTCAA GATCGACCCG
CTCGAACTAC GGCTGATCAA CTACGCCGAA GTAGACCCCG AGAGCGGCAA GCCATTTTCG
AGTAAGGCGT TGCGGGAGTG CTATCGGCTC GGCGCGGAAA AATTCGGTTG GAAGAAGCGC
CAGTTTGAGC CGCGCTCAAT GCGCGACGGA CGCCTACTCG TCGGTTGGGG TATGGCAACC
GGCGTCTGGG GCGCTTTCCA GATGCCTGCT GCCGCCCTCA TCACGCTGCG GGTGGATGGC
ACGGCGCAAG TTGCTAGTGC GACCAGTGAC ATCGGCCCCG GCACCTACAC CGTGATGACC
ATGATCGCTG CTGAGTATCT GGGGCTGAAG TTGGAGCAGG TGAAATTTGA ACTCGGCGAC
ACGAAGTTCC CTCGCTCACC GTCGCAGGGG GGATCTTGGA CGACGGCGAG CGTCGGCTCG
GCCGTGCGCG GCGCGGCTCT GGCCATCGGC ACGAAGTTGC TCGCACTCGC AAACCAGGAA
CCGAATTCTC CGCTCAAGGG GGCAGCCGCC GCCGATGTCG AGATGCTCGA CGGGAGACTG
CGACTAAAAA GCGACCCGTC GCGTTTCGTC AATATCTCTG GAGTGATGAA GCGTAATGGT
CTCACCGCAA TCACGGAAAC ATTCGAGTCG CGTCCCTCAG AGGAGCGCGA GAAGTATGCG
ACGTTGGCGC ACGGCGCGCA GTTCATTGAA GTGAAGGTCG ATCCAGATGT GGGGACTGTC
CACGTTACGC GGGCCATTGA AGTGACTGCC AGCGGCAAGA TCATGAATCC GAAGGCCTCG
CACAGCCAGG AATTTGGCGG CGTTGTCTGG GGCATCGGCA TGGCACTGCA AGAGGCGACT
GAAATCGACC ATCGTTACGG GCGGATCATG AACCCGAATT TGCAGCACTA CCATGTGCCG
GTTAATGCCG ATATCTTGGA GATCGAAACG ATCTTTGTTG AGGAGGACGA TAAAATCGTC
AATCCGCTCG GTGTTAAAGG CATGGGCGAA CTCGGTATGG TCGGAATTCC GGCAGCGATC
GCCAATGCGG TTTTTCACGC GACCGGCAAG CGAATCAGAG ATTTGCCCAT CACGCCTGAC
AAACTCTTGT AG
 
Protein sequence
MARYIGKEMS RVDGIAKVTG KAKYAAEFQV SNLAYGFIVL GTVAKGTIKS IDTGEAERAA 
GVIRVFTHLN TPKLDPKFSM VQAPSRATDE QDKSFRALQS DKIFFNMQPV ALVVAETYEQ
ARYAARLVKV SYNTEPHTTD TEAVRAIARV PTQAPPLKPR GNPEETMRTA AIETQPLVVE
HTLTRGNPEE TMRTAAVKVE AEYRIPIEHH NPIEPHAAIA VWQGDKLTIF DKTQGVYGVR
AHLASSFGVP EENVSVLSPF VGGAFGSSLR PNYYPALTAM AARELKRPVK VVYTRTQMFT
GHGYRPYTIQ KVALGAERSG KLSTMIHEVV HNTSNIEEFS DETTLFTRQV YACPNLYAPL
KITNTDLPTP TWMRAPGAVS GMFALECAMD ELAYALKIDP LELRLINYAE VDPESGKPFS
SKALRECYRL GAEKFGWKKR QFEPRSMRDG RLLVGWGMAT GVWGAFQMPA AALITLRVDG
TAQVASATSD IGPGTYTVMT MIAAEYLGLK LEQVKFELGD TKFPRSPSQG GSWTTASVGS
AVRGAALAIG TKLLALANQE PNSPLKGAAA ADVEMLDGRL RLKSDPSRFV NISGVMKRNG
LTAITETFES RPSEEREKYA TLAHGAQFIE VKVDPDVGTV HVTRAIEVTA SGKIMNPKAS
HSQEFGGVVW GIGMALQEAT EIDHRYGRIM NPNLQHYHVP VNADILEIET IFVEEDDKIV
NPLGVKGMGE LGMVGIPAAI ANAVFHATGK RIRDLPITPD KLL