Gene Ava_C0026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0026 
Symbol 
ID3677791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp43760 
End bp44899 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content49% 
IMG OID637715110 
Productmajor facilitator transporter 
Protein accessionYP_320304 
Protein GI75812687 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.371223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0147554 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAC CCCAGTTCAT TGTGGCAGTA GCCATTTTCC TCATCACCCA CGCAACGAAT 
TTGCAAATTC CCCTGTACGG TACTTATGCA AAGATGGCAG GTTTTGGTAG TGGTGTTTCG
GCGATCGTTT TCTCAACCTA TATTGCAGGA TTGGTGCCAA CGCTGCTTCT GCTGGGTGGA
GCCTCTGACC GGATTGGACG CAAAATTGTC ATCCTCACAA GTTTGCTATT AGCTTGTGTC
GCAACGTTTT TAATGATTGT TCAGCCGAAT ATCTACACGC TGTTCGTCAC ACGAGTTCTA
CAAGGAATTA GTGTTGGTTT CATGACTGGA ACAGGAACGG CATACCTATC AGCGCTAATG
CCACAAAATG CCACGAAGGT TGCTGCTTAC GTCAGCTTAA CTACTGCTTT GGGCTTTTCC
AGTGGTGCGC TGTTTACGAA TGCCACTTTG TTTTATCGCT ATTCACTGGT GCCGCTTAGC
TATTGGGTCG TCTTTATTCT CCTTCTAGGC TGTATTAGTT TAGCTATTAG TATTCCAGAA
CAGGCAACGG CTTCAGCAGC GTTGATCCGG CTGCCAAGTT TCTCAATGGG CGCAGTTTGG
GCAGGGTTAG CGATCGCATT GGCATGGTCT TTGGCAGGAA TTGTCGGTGT CATTTTACCT
ACCCAGCTAA CAAGATATGG GCTACCGAAC TGGGCAGGTC TAATGCTATT CATCATTACG
ATCGCAGGGG TTGTGTTTCA ACCTTTTGCC CGTCGGCTAG AGGCACGGCG ATCGCTTCAG
ATCGGTGCTG TCCTGCTGGT AACTGGCTAT TTTAGCTTCA CATGTGGAGC ATGGCTTGGT
CATTTAGGGT TGGTGCTGGC AGGAGTGGCG ATCGCTGGAA CGGCGTGCTA CGGGTTCACC
TACCTGGGCG GATTGGCTGA AGTGGTACAG ATAAGCGGCA CTCAATCTGC GAGATTCACC
TCTGGCTATT TTGTCTGCGC TTATTTGGGG TACGGCATCC CTGTAATTCT GATTGGCTTT
GTGTCTGACA AATTTGGCGT GATGCAGGCG TTATTTGGCT TTGGCGCAGT CTTGCTGGTT
TGCAATGCCC TACTCTTTGT CAGATATCAA CGAATAGCAA AAAACCAGTA TCTTGCTTAG
 
Protein sequence
MKRPQFIVAV AIFLITHATN LQIPLYGTYA KMAGFGSGVS AIVFSTYIAG LVPTLLLLGG 
ASDRIGRKIV ILTSLLLACV ATFLMIVQPN IYTLFVTRVL QGISVGFMTG TGTAYLSALM
PQNATKVAAY VSLTTALGFS SGALFTNATL FYRYSLVPLS YWVVFILLLG CISLAISIPE
QATASAALIR LPSFSMGAVW AGLAIALAWS LAGIVGVILP TQLTRYGLPN WAGLMLFIIT
IAGVVFQPFA RRLEARRSLQ IGAVLLVTGY FSFTCGAWLG HLGLVLAGVA IAGTACYGFT
YLGGLAEVVQ ISGTQSARFT SGYFVCAYLG YGIPVILIGF VSDKFGVMQA LFGFGAVLLV
CNALLFVRYQ RIAKNQYLA