Gene Ava_0465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0465 
Symbol 
ID3682431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp590811 
End bp592061 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content37% 
IMG OID637715794 
Productextracellular ligand-binding receptor 
Protein accessionYP_320986 
Protein GI75906690 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0898027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.196528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAA CTTTTGCGTA TACCACAGCA TTATTCTCTG CTTGTACTTT TTTTCTGGCA 
GCTTGTAGTG GTACTAACAC AGACACAAAT TCGACCAACA ATTCTCCAAA TAACACAACT
AATACTACAA CAAACGTTAC TACAACAAGC GATAAAAATA CTATTCCTAT TGGCATCGCC
CTAGCACAAA CTAGTAATGT AGCTCTACTT GGTCAAGAAC AAGTCGCTGG AGCTAAAATT
GCTGAAAAGT ATTTCAATGA TAAAGGTGGA GTCAACGGTA CTCCAATTAA ACTAATTTTT
CAAGATACGG CTGGTGACGA AGCTGGAACA ATTAATGCTT TTCAAACTCT TATCAATAAA
GATAAAGTCG TAGGTATTGT AGGTCCTACA TTATCCCAAC AAGCCTTTAG TGCTAATCCC
ATTGCTGAAA GAGCAAAAGT TCCAGTGGTT GGACCATCAA ATACAGCAAA AGGTATACCA
GAAATTGGTG ATTATGTGGC GCGCGTTTCT GCACCCGTTT CTGTAGTTGC CCCTAATTCA
GTCAAAGCTG CACTTAAGCA AAATCCCAAC ATTAAAAAAG TCGCGGTTTT CTTTGCTCAA
AATGATGCCT TTAGCAAATC AGAAACAGAG ATTTTTCAAC AAACTGTTAA GGATCAAGGA
CTAGAATTAG TAACAGTACA AAAGTTCCAA ACTACTGATA CTGACTTTCA ATCTCAGGCG
ACTAATGCAA TTAATTTAAA ACCTGATTTA GTGATTATTT CTGGACTAGC AGCCGATGGT
GGAAACTTAG TTAGACAGTT GCGTGAACTT GGTTATCAAG GTGCGATTAT TGGCGGTAAT
GGTTTAAATA CATCAAATGT TTTCGCAGTT TGTAAAGCAC TTTGTGACGG TGTATTAATA
GCTCAAGCTT ACAGTCCAGA ATATACTGGT GAAATTAATA AGGCATTTCG TCAAGCCTAC
GTTGATCAAT ATAAGAAAGA ACCGCCCCAA TTTAGCGCTC AAGCCTTTGC CGCAGTACAA
GTATATGTAG AATCTCTCAA AGCATTAGAT ACCAAGAATA AAGTTAGTAA AATACAGTTA
CCGGAATTAC GTACAGAACT GAACAAACAG CTACTAACAG GTAAATACAA TACACCATTA
GGAGAAATTA GTTTTACACC AATCGGTGAA GTTGTACAAA AAGATTTTTA TGTAGCTCAA
ATTAAAATGG AAAAGGATGG TAGTCAAGGT AAGTTTACAT TCTTGAAATA G
 
Protein sequence
MKITFAYTTA LFSACTFFLA ACSGTNTDTN STNNSPNNTT NTTTNVTTTS DKNTIPIGIA 
LAQTSNVALL GQEQVAGAKI AEKYFNDKGG VNGTPIKLIF QDTAGDEAGT INAFQTLINK
DKVVGIVGPT LSQQAFSANP IAERAKVPVV GPSNTAKGIP EIGDYVARVS APVSVVAPNS
VKAALKQNPN IKKVAVFFAQ NDAFSKSETE IFQQTVKDQG LELVTVQKFQ TTDTDFQSQA
TNAINLKPDL VIISGLAADG GNLVRQLREL GYQGAIIGGN GLNTSNVFAV CKALCDGVLI
AQAYSPEYTG EINKAFRQAY VDQYKKEPPQ FSAQAFAAVQ VYVESLKALD TKNKVSKIQL
PELRTELNKQ LLTGKYNTPL GEISFTPIGE VVQKDFYVAQ IKMEKDGSQG KFTFLK