Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4469 |
Symbol | |
ID | 3680325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 5595463 |
End bp | 5597133 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637719824 |
Product | ribulose bisphosphate carboxylase, small chain |
Protein accession | YP_324962 |
Protein GI | 75910666 |
COG category | [R] General function prediction only |
COG ID | [COG0663] Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.97747 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGTCC GCAGCACGGC GGCACCCCCA ACCCCGTGGT CGAGGAGTTT AGCTGAAGCC CAAATCCATG AAAGCGCCTT TGTACATCCG TTTTCTAACA TTATTGGGGA TGTTCATATC GGTGCAAATG TCATCATTGC TCCAGGGACT TCAATCAGAG CCGATGAAGG TACACCCTTT CATATTGGTG AAAATACCAA TATTCAAGAC GGTGTAGTGA TTCACGGTTT AGAGCAAGGT AGAGTCGTTG GTGATGACAA CCAAGAATAC TCCGTTTGGG TGGGTAGCAG CGCTTCCTTG ACACATATGG CGTTGATTCA TGGCCCTGCT TACGTTGGGG ATAACTCGTT TATTGGTTTT CGCTCTACGG TATTTAATGC CAAGGTGGGA GCAGGTTGCA TCGTCATGAT GCACGCTTTA ATTAAGGACG TAGAAGTTCC CCCTGGTAAG TACGTTCCTT CAGGAGCGAT CATCACTAAT CAAAAGCAGG CCGATCGCTT GCCAGATGTG CAACCTCAAG ACAGGGATTT TGCTCATCAC GTAATTGGGA TTAATCAAGC ATTGCGGGCT GGATATCTTT GTGCTGCGGA TAGCAAGTGT ATTGCCCCCC TTCGCAATGA TCAAGTTAAA TCTTATACAA GTAATACAGT TATTGGGTTA GAAAGGAGTA GTGAAGTGGC AAGCAACAGC TTGGGTGCAG AAACCATAGA GCAGGTACGC TATTTATTAG AGCAAGGCTA TAAGATTGGG ACAGAACACG TAGATCAAAG AAGATTTCGT ACAGGTTCTT GGACTAGTTG CCAGCCAATT GAAGGTAGAT CCGTAGGAGA TGCCTTAGCA GCTTTAGAAG CTTGTTTAGC TGACCATAGT GGTGAGTATG TACGTTTATT CGGCATTGAC CCCAAAGGTA AACGGCGAGT TTTAGAAACA ATTATCCAAC GTCCCGATGG TGTGGTGGCA GGTTCTACCA GCTTCAAAGC GCCTGCTAGT AACACCAATG GCAATGGTAG CTACCACAGC AACGGCAATG GTAACGGTTA TAGTAACGGT GCAGCCAGTG GTAAAGTCAG TGCTGAAACC GTAGACCAAA TTCGCCAGTT ATTGGCTGGT GGTTACAAAA TTGGCACAGA ACACGTAGAT GAGCGTCGCT TCCGTACAGG TTCCTGGAAT AGCTGTAAGC CAATTGAAGC AAACTCCCCA GGTGAAGTAG TGGCGGCTTT AGAAGAATGT ATCGACAGTC ATCAAGGTGA GTACATCCGC CTCATCGGCA TTGACCCGAA AGCCAAACGG CGTGTATTGG AAAGTATTAT CCAACGTCCC AACGGTCAAG TAGCTCCATC GACGAGTAGC CCCAGAACCG TCGTGAGTGC CTCATCTGCT TCATCCGGAA CAGCTACCGC AACAGCTACC CGCTTAAGTA CAGAAGTAGT AGATCAGGTG CGGCAAATAC TGGGTGGTGG GTATAAACTC AGCATTGAAC ACGTAGATCA AAGAAGATTC CGTACTGGTT CTTGGACTAG TACCGGGGCA ATTTCCGCTA CTTCCGAAAG AGAAGCGATC GCAGTCATAG AAGCCTCCTT ATCCGAATTT GCTGGAGAAT ATGTGCGCTT GATTGGTATC GACCCCAAAG CCAAGAGGCG AGTGTTGGAA ACAATCATTC AGCGTCCATA G
|
Protein sequence | MAVRSTAAPP TPWSRSLAEA QIHESAFVHP FSNIIGDVHI GANVIIAPGT SIRADEGTPF HIGENTNIQD GVVIHGLEQG RVVGDDNQEY SVWVGSSASL THMALIHGPA YVGDNSFIGF RSTVFNAKVG AGCIVMMHAL IKDVEVPPGK YVPSGAIITN QKQADRLPDV QPQDRDFAHH VIGINQALRA GYLCAADSKC IAPLRNDQVK SYTSNTVIGL ERSSEVASNS LGAETIEQVR YLLEQGYKIG TEHVDQRRFR TGSWTSCQPI EGRSVGDALA ALEACLADHS GEYVRLFGID PKGKRRVLET IIQRPDGVVA GSTSFKAPAS NTNGNGSYHS NGNGNGYSNG AASGKVSAET VDQIRQLLAG GYKIGTEHVD ERRFRTGSWN SCKPIEANSP GEVVAALEEC IDSHQGEYIR LIGIDPKAKR RVLESIIQRP NGQVAPSTSS PRTVVSASSA SSGTATATAT RLSTEVVDQV RQILGGGYKL SIEHVDQRRF RTGSWTSTGA ISATSEREAI AVIEASLSEF AGEYVRLIGI DPKAKRRVLE TIIQRP
|
| |