Gene Ava_3398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3398 
Symbol 
ID3680059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4226008 
End bp4227243 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content43% 
IMG OID637718748 
Productgeneral substrate transporter 
Protein accessionYP_323900 
Protein GI75909604 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0177815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAGCTT TTAATACATT TGACGCAAGC TTGCGGCTTA ACCTGCTGAT TCTATTTACG 
GCAGGTTTAT TGTTCTGGTC GAGTACTGCT ACTTTCTTGC CCACTCTGCC CTTATATATT
GAGGATGTGG GAGGAAGCAA GCAAGAAATT GGCATTGTGA TGGGTGGTTT TGCTATTGGG
TTGTTAGTAT TTCGCCCAAT GCTGGGACGA ATGGCGGATC AAAACGGTCG GAAGTTACTG
TTATTAATTG GGACAATAGT GGCAACAATT GCCCCCTTTG GCTATTTGGC ATTTAAATCA
ATTCCTTTAT TGATGCTGGT GCGCGTCTTT CATGGCATTA GCATTGCTGC TTTTACCACT
GGTTACAGTG CTTTAATAGC AGATTTAGCC CCTATAGCCA TTCGTGGTGA AATCATCAGT
TACATGAGTC TCACTGCTCC CATTGGCTTG GCAATTGGCC CGGCTTTAGG GGGTTATCTA
CAAGCTTCAA TTGGTTATCC AATTTTATTT TTAATAGCAT CCGAATTGGC TTTTGTGGGG
TTATTGGGAA CGATTCAAGT TTCTAATCCA CCTGTACCAC AAGGTCGCCA AGCAACAGAA
AAGGATAGTA ATTTCTGGCA ACTTTTAAGT AGCCCACGGG TGAGAGTGCC AACTTTGGTG
ATGTTGCTCA TTGGTATAGC TATCGGTGCT GTGCATATTT TTTTACCACT GTTTATTAAA
TCAACAGGGG TGGAATTTAA CGCCGGACTA TTTTTTACGA TCGCGGCCAT TGGTAGTTTC
AGTTTACGGG TATTTGCAGG GAAAGCTAGC GATCGCTTCG GTCGGGGTTT GTTTATTACT
TTCGGTATCA TGGCTTATAT GTTGTCATCT TTCTTGTTAT GGCAAGCCAA CAGTGCCATT
AGTTTCGCTA TTGCAGCGAT CGCTGAAGGT TGTGGCGGCG GAACAATGAT TTCGATGATT
ACGACGATGA TGGCAGACCG CTCGCTACCA CAAGAGCGAG GACGAATTTT CTCTATTTGT
ATCGCTGGAT TGGATTTAGG AATTGCGATC GCTGCCCCTA TTTTAGGTTT TATTGCTGAA
GCGACTGGCT ATCGCAGTAT GTTTGCCTAT ACAACTGCTT TAACTTTCCT AGCCTTACTA
ATTTTCCTGA CCAGATCGAG TAAAAATTTG AGCAATTCCC TGCGGTTTGC TCTGGGTCGC
GGTCAAGATG TCTATTCTCT GCATAATAGT AACTAG
 
Protein sequence
MKAFNTFDAS LRLNLLILFT AGLLFWSSTA TFLPTLPLYI EDVGGSKQEI GIVMGGFAIG 
LLVFRPMLGR MADQNGRKLL LLIGTIVATI APFGYLAFKS IPLLMLVRVF HGISIAAFTT
GYSALIADLA PIAIRGEIIS YMSLTAPIGL AIGPALGGYL QASIGYPILF LIASELAFVG
LLGTIQVSNP PVPQGRQATE KDSNFWQLLS SPRVRVPTLV MLLIGIAIGA VHIFLPLFIK
STGVEFNAGL FFTIAAIGSF SLRVFAGKAS DRFGRGLFIT FGIMAYMLSS FLLWQANSAI
SFAIAAIAEG CGGGTMISMI TTMMADRSLP QERGRIFSIC IAGLDLGIAI AAPILGFIAE
ATGYRSMFAY TTALTFLALL IFLTRSSKNL SNSLRFALGR GQDVYSLHNS N