Gene Ava_4469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4469 
Symbol 
ID3680325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5595463 
End bp5597133 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content47% 
IMG OID637719824 
Productribulose bisphosphate carboxylase, small chain 
Protein accessionYP_324962 
Protein GI75910666 
COG category[R] General function prediction only 
COG ID[COG0663] Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.97747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTCC GCAGCACGGC GGCACCCCCA ACCCCGTGGT CGAGGAGTTT AGCTGAAGCC 
CAAATCCATG AAAGCGCCTT TGTACATCCG TTTTCTAACA TTATTGGGGA TGTTCATATC
GGTGCAAATG TCATCATTGC TCCAGGGACT TCAATCAGAG CCGATGAAGG TACACCCTTT
CATATTGGTG AAAATACCAA TATTCAAGAC GGTGTAGTGA TTCACGGTTT AGAGCAAGGT
AGAGTCGTTG GTGATGACAA CCAAGAATAC TCCGTTTGGG TGGGTAGCAG CGCTTCCTTG
ACACATATGG CGTTGATTCA TGGCCCTGCT TACGTTGGGG ATAACTCGTT TATTGGTTTT
CGCTCTACGG TATTTAATGC CAAGGTGGGA GCAGGTTGCA TCGTCATGAT GCACGCTTTA
ATTAAGGACG TAGAAGTTCC CCCTGGTAAG TACGTTCCTT CAGGAGCGAT CATCACTAAT
CAAAAGCAGG CCGATCGCTT GCCAGATGTG CAACCTCAAG ACAGGGATTT TGCTCATCAC
GTAATTGGGA TTAATCAAGC ATTGCGGGCT GGATATCTTT GTGCTGCGGA TAGCAAGTGT
ATTGCCCCCC TTCGCAATGA TCAAGTTAAA TCTTATACAA GTAATACAGT TATTGGGTTA
GAAAGGAGTA GTGAAGTGGC AAGCAACAGC TTGGGTGCAG AAACCATAGA GCAGGTACGC
TATTTATTAG AGCAAGGCTA TAAGATTGGG ACAGAACACG TAGATCAAAG AAGATTTCGT
ACAGGTTCTT GGACTAGTTG CCAGCCAATT GAAGGTAGAT CCGTAGGAGA TGCCTTAGCA
GCTTTAGAAG CTTGTTTAGC TGACCATAGT GGTGAGTATG TACGTTTATT CGGCATTGAC
CCCAAAGGTA AACGGCGAGT TTTAGAAACA ATTATCCAAC GTCCCGATGG TGTGGTGGCA
GGTTCTACCA GCTTCAAAGC GCCTGCTAGT AACACCAATG GCAATGGTAG CTACCACAGC
AACGGCAATG GTAACGGTTA TAGTAACGGT GCAGCCAGTG GTAAAGTCAG TGCTGAAACC
GTAGACCAAA TTCGCCAGTT ATTGGCTGGT GGTTACAAAA TTGGCACAGA ACACGTAGAT
GAGCGTCGCT TCCGTACAGG TTCCTGGAAT AGCTGTAAGC CAATTGAAGC AAACTCCCCA
GGTGAAGTAG TGGCGGCTTT AGAAGAATGT ATCGACAGTC ATCAAGGTGA GTACATCCGC
CTCATCGGCA TTGACCCGAA AGCCAAACGG CGTGTATTGG AAAGTATTAT CCAACGTCCC
AACGGTCAAG TAGCTCCATC GACGAGTAGC CCCAGAACCG TCGTGAGTGC CTCATCTGCT
TCATCCGGAA CAGCTACCGC AACAGCTACC CGCTTAAGTA CAGAAGTAGT AGATCAGGTG
CGGCAAATAC TGGGTGGTGG GTATAAACTC AGCATTGAAC ACGTAGATCA AAGAAGATTC
CGTACTGGTT CTTGGACTAG TACCGGGGCA ATTTCCGCTA CTTCCGAAAG AGAAGCGATC
GCAGTCATAG AAGCCTCCTT ATCCGAATTT GCTGGAGAAT ATGTGCGCTT GATTGGTATC
GACCCCAAAG CCAAGAGGCG AGTGTTGGAA ACAATCATTC AGCGTCCATA G
 
Protein sequence
MAVRSTAAPP TPWSRSLAEA QIHESAFVHP FSNIIGDVHI GANVIIAPGT SIRADEGTPF 
HIGENTNIQD GVVIHGLEQG RVVGDDNQEY SVWVGSSASL THMALIHGPA YVGDNSFIGF
RSTVFNAKVG AGCIVMMHAL IKDVEVPPGK YVPSGAIITN QKQADRLPDV QPQDRDFAHH
VIGINQALRA GYLCAADSKC IAPLRNDQVK SYTSNTVIGL ERSSEVASNS LGAETIEQVR
YLLEQGYKIG TEHVDQRRFR TGSWTSCQPI EGRSVGDALA ALEACLADHS GEYVRLFGID
PKGKRRVLET IIQRPDGVVA GSTSFKAPAS NTNGNGSYHS NGNGNGYSNG AASGKVSAET
VDQIRQLLAG GYKIGTEHVD ERRFRTGSWN SCKPIEANSP GEVVAALEEC IDSHQGEYIR
LIGIDPKAKR RVLESIIQRP NGQVAPSTSS PRTVVSASSA SSGTATATAT RLSTEVVDQV
RQILGGGYKL SIEHVDQRRF RTGSWTSTGA ISATSEREAI AVIEASLSEF AGEYVRLIGI
DPKAKRRVLE TIIQRP