Gene Ava_4102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4102 
Symbol 
ID3681567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5104259 
End bp5105575 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content45% 
IMG OID637719450 
Productmajor facilitator transporter 
Protein accessionYP_324598 
Protein GI75910302 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.277859 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.56542 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAAC ATCCAAAAAA TACAGGAATG CAAACTTTTA CCATCATCTG GTTTGGACAG 
ATGATTTCCC TCATCGGCTC GCAGCTAACT AATTTTGCCT TGGGTGTATG GGTATACCAG
CGAACTGGCT CAGTCACACA GTTCGCCTTG ATTTCCCTCT TTACCAGCTT GCCCATGATT
CTGATTTCTC CCGTAGCTGG CACGCTGGTA GACCAATTCC CCCGTCGTTG GATGATGTTA
TTTAGTGACT TAGGAGCAGG TATTTCTACG GGGGTAATTG CAATTTTGTT AGCTACAGGC
GATTTGGCTA CTTGGCATAT ATACGTGGGT GCTGCTATTA GTTCCTGCTT TGGTGCTTTT
CAATGGCCAG CTTATACAGC AGCTACTACC TTGCTTGTCC CACCAGAAAA ACTGGCACGA
GCTAACGGTA TGTTGCAAGT GGGGGAAGCC GCAGGTCGGT TAGTTGCACC AATGTTAGGA
GGTATACTGC TGCTGTTTCT GGAAATTGAT GGCATTATCT TTATTGACTT TGCGACATTT
CTGTTTGCTT TGAGTACTCT GTTACTAGCT CCGTTTCCCA AGCAGTACAT TGATAGACAT
CGCGCGGAAA AAACTCCTTG GTTGAAGGAA GCATCTTCCG GTTTGGTCTA TCTAGTTAAC
AGAAGAGGAC TGTTTGCACT ACTACTGTTC TTTGCTGTGA ACAATTTCCT AGTGGGAATT
GTGCAGATGC TAATTACGCC GCTAGTATTG TCCTTTGGTT CGGCTACAGA CTTGGGGACA
ATTATGACTA CTGGCGGTAT CGGAATGCTA GTAAGCAGCA TCCTTGTCAG TACCGTGAGA
ATGCCACAGT ATTTAGCTCT CAGTATCTTT ACTTTTATGC TGCTAGGTGG GATCTGTATT
ACCTGTGCAG GGTTTTACCA ATCGATTTTA GCCTTAGCGC TGATAGCTTT CCTGTTTTTC
TTTGGTCTAC CAATTATTAA CAGTTCAGCC CAAGTTATTT TTCAAAAGAA AGTACCATCT
AGTCTGCAAG GTCGAGTTTT TGCGACAATA GGAGCGATCG CTAACGCATC ACAGCCTTTG
GCTTACACTG TCGCTGGGCC ATTAGCGGAT AAAATCTTCG AGCCGTTAAT GGCTCAGAAT
GGGCTGTTAG CAGAAAGTAT GGGAAAAATT ATTGGTGTTG GTCAAGGACG TGGTATCGGT
CTGATGTTTA TCGTGATGGG AATACTCACC GTATTGGCGA CGATTATCGC CTATCAGTAT
AAACCATTGA GACTTGTGGA AAGGCAACTG CCCGATGCCA TGAATCCCAG TTGCTAG
 
Protein sequence
MTQHPKNTGM QTFTIIWFGQ MISLIGSQLT NFALGVWVYQ RTGSVTQFAL ISLFTSLPMI 
LISPVAGTLV DQFPRRWMML FSDLGAGIST GVIAILLATG DLATWHIYVG AAISSCFGAF
QWPAYTAATT LLVPPEKLAR ANGMLQVGEA AGRLVAPMLG GILLLFLEID GIIFIDFATF
LFALSTLLLA PFPKQYIDRH RAEKTPWLKE ASSGLVYLVN RRGLFALLLF FAVNNFLVGI
VQMLITPLVL SFGSATDLGT IMTTGGIGML VSSILVSTVR MPQYLALSIF TFMLLGGICI
TCAGFYQSIL ALALIAFLFF FGLPIINSSA QVIFQKKVPS SLQGRVFATI GAIANASQPL
AYTVAGPLAD KIFEPLMAQN GLLAESMGKI IGVGQGRGIG LMFIVMGILT VLATIIAYQY
KPLRLVERQL PDAMNPSC