Gene Avin_21230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_21230 
SymbolcsbX 
ID7761048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2120116 
End bp2121339 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content67% 
IMG OID643805018 
Productcatecholate siderophore efflux pump, MFS_1 family 
Protein accessionYP_002799299 
Protein GI226944226 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000768503 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCGG ACCAATCCCT GCAATCGGAA GCACGAACGA AACCTTTTGC AGCAATTTCA 
CCCAAGGTGC TCATCGCCCT GATCGCCTCC CTGCAGTTCA CCTACATTCT CGATTTCATG
CTGGTCCTGC CGCTCGGCCC GGATTTGGCC AAAGGTCTCA ACTTTCACGG TAATCAAGTT
GCATGGCTAA CAGCCAGCTA TACATTTGCG TCCTTGCTGT CCGGTCTGTT CACCGTGCCG
CGCCTGGATC GCTTCGATCG CCGCAAGGCC CTGCTCTGGA GCCTGGTGGG ACTGGCGCTC
GCTTCGCTGG CCTGTACCCT GGCGCATGAT TTGCCGAGCC TGCTGCTCGG ACGGGCCGTC
GCCGGGCTCT GTGCCGCCCC GGCGATCGCC ACCGGCATGG CGATCCTGAT CGACCAGACG
CCGCCGCCAC AGCGCGGGAC AGCCATCGCC AAGGTCATGA CCGGGTTTTC CATCGCCACC
ATCGCCGGGA TTCCCCTGGC GCTGGAACTG GCGGACCACT TCGGCTGGCA GGCGCCCTTC
GTGCTGGTGG CTGTGCTGGT GGTGCTGGTC GCCTTCGCCG TGGCCCATCT GTTGCCGCCG
CTGACCGCCC ACCTGCAAGG GCCGGCTGGC AAGCCGTCGC TGGCCATGCT GAGCCGTCCC
GGCGTGCGCC TGGCCACGCT GTTGCAGGGG CTCAACCAGT TCTCCGCCTT CCTGGTGATC
CCGAGCTTCT CCGCGTTCTA CCTGCTGAAC CTCGATTACC CGCGTCAGCA ACTGGGTACG
CTGTACCTGG TGGGCGGTCT GGTGGCGCTG GGGGCCATGC AACTGGCCGG ACGCCTGGGC
GATCGACACG GCCACTGGCT GCCGGTCGGC GTGGCCAGCG CCTGTTTCGC CGTCGGCCTG
CTGCCGTTCT TCGGCCTGAG CGCCTTGCCG CTGATGCTGA GCTTCGTGCT GTTCATGGCC
GGCAACGCCG CCCGCACGGT CTGCCTGGCG GCCGCCATCA GCCATGTTCC GGCGCCGCCG
GAGCGCGCCG GTTTCATGGC CCTGCAGAAG ATGACCCAGG ATTTCAGCGT GGCCCTGGCC
GCCGGCGCCG CGGCGCTGGT GCTGGGTGGC GGCGACGGGC CGCTGACCCA TACCGGACTG
CTCGCCACCC TGGCGATCGT CGGTGCGGGG CTGGTCCTCT GGGTGCTCGA ACGGCTGCGG
CGAAACCTGC AGCCGCCGGC CTGA
 
Protein sequence
MNPDQSLQSE ARTKPFAAIS PKVLIALIAS LQFTYILDFM LVLPLGPDLA KGLNFHGNQV 
AWLTASYTFA SLLSGLFTVP RLDRFDRRKA LLWSLVGLAL ASLACTLAHD LPSLLLGRAV
AGLCAAPAIA TGMAILIDQT PPPQRGTAIA KVMTGFSIAT IAGIPLALEL ADHFGWQAPF
VLVAVLVVLV AFAVAHLLPP LTAHLQGPAG KPSLAMLSRP GVRLATLLQG LNQFSAFLVI
PSFSAFYLLN LDYPRQQLGT LYLVGGLVAL GAMQLAGRLG DRHGHWLPVG VASACFAVGL
LPFFGLSALP LMLSFVLFMA GNAARTVCLA AAISHVPAPP ERAGFMALQK MTQDFSVALA
AGAAALVLGG GDGPLTHTGL LATLAIVGAG LVLWVLERLR RNLQPPA