Gene Ava_3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3042 
Symbol 
ID3681160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3765632 
End bp3766852 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content43% 
IMG OID637718388 
Productextracellular solute-binding protein 
Protein accessionYP_323547 
Protein GI75909251 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0658677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCG CGATCGCCAT TGTTGCTTAC CATAGTTGGC AAATACAGAG TTCGCCTTCA 
GCATCTGCCC CAGTCACTGT TAAACTCAGT GGCTGGGGTG GTTCTCCGGT TGAGCAAAAA
CTATTGAAAC AGGTTTTACA AGACTTTGAA GCACAGCATC CTCTGATCAA GGTCAAATAC
GAGGTGATTT CTGACCAATA CATGGACGTG ATCAAAACCC GCTTGATTGG AGAAGCTGCC
CCTGATGTCT TCTACTTGGA TGCTCTGGAG GCTCCTTTTT TGATGAGCCA GAATGTGCTG
GAACCATTAG AAAGTTACAT CACTCCCGCA TTTGACTTAA GTGACTTTGA AACCAATCTC
CTAGATAGCT TTAAATACCA GAACCATATT TACGGTTTCC CTAAGGACTA TTCCACTTTG
GCGCTGTTTT ATAACAAAAA AGCCTTTGCT GCTGCTGGTT TGAGCAGCCC CCCTGCGACA
TGGCAAGAAT TACGCACCTA CTCCCGAAAA TTAACAGGTA AACTCAACAA GTATGGCTTT
GGGGAAGCAC CGGAATTAGC TCGCCAAGTT TACAAAATTA AAGCTTTTGG TGGAGAAGTT
ATCAATCAAA ATGGTCATGC TACCTTTGCG AGTGAAGCTG GTTTACGGGG ATTGCAATTA
GTGATAGACC AGTATCAAAA AGATAAATCA TCTGCTCAAA AATCTGACGT AGGGACAAAC
TCAGGTAGCG AAATGTTTGG TCAGGAGAAA GTGGCAATGG TAATTGAAGG TAATTGGGCA
ATTCCATACT TACAAGAAAC CTTTCCGCAA TTGGAGTTTG CAACTGCACA ATTACCAACG
ATTAATCAAA AAAAAGGCAC AATGGTATTC ACTGTTGCCT ATGTCATGAG TAAGCAATCA
CAGCATAAAG CTGAAGCATG GGAGTTAATT TCCTATCTCA CAGGTAAAGC CGGAATGCAG
AAATGGACAA GTACAGGCTT TGCTTTACCT ACACGCAAAT CAGTATCCCA AAAACTAGGA
TATGAGCAAG ATCCCTTGCG ATCGCCTTTA GTTGCCGGTG TTGATGATGC TACACCCTGG
CAGGTTGGTA AATACCCAGC CCCAATTGTG AACAATTTTG ATAATCAATT TGTCAGTGCC
TTACTAGGAC AACAACCATT AAAACAGGCG ATGCTCAGGG CGCAGAATCA GGCAAATAAG
CAAATTCAAG CAATGGAGTG A
 
Protein sequence
MAIAIAIVAY HSWQIQSSPS ASAPVTVKLS GWGGSPVEQK LLKQVLQDFE AQHPLIKVKY 
EVISDQYMDV IKTRLIGEAA PDVFYLDALE APFLMSQNVL EPLESYITPA FDLSDFETNL
LDSFKYQNHI YGFPKDYSTL ALFYNKKAFA AAGLSSPPAT WQELRTYSRK LTGKLNKYGF
GEAPELARQV YKIKAFGGEV INQNGHATFA SEAGLRGLQL VIDQYQKDKS SAQKSDVGTN
SGSEMFGQEK VAMVIEGNWA IPYLQETFPQ LEFATAQLPT INQKKGTMVF TVAYVMSKQS
QHKAEAWELI SYLTGKAGMQ KWTSTGFALP TRKSVSQKLG YEQDPLRSPL VAGVDDATPW
QVGKYPAPIV NNFDNQFVSA LLGQQPLKQA MLRAQNQANK QIQAME