Gene Avin_18040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_18040 
Symbol 
ID7760739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1794188 
End bp1795558 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content66% 
IMG OID643804703 
Productdicarboxylate or citrate transporter (MFS superfamily) 
Protein accessionYP_002798992 
Protein GI226943919 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0280762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATCC CCAGCAGTCC CGACCTGTGC GCCACTGCGG ACCTTGCCGC CGACCGTTCC 
GACGCAACCC CGTCCGGAGC CGCCGAAAAG ACCGAATACA CGGCCGCCGA ACGCCGCCAG
CGGATCTTCG CCATCGTCGG CGCCTCCTCC GGCAACCTGG TGGAATGGTT CGACTTCTAC
GTCTACGCCT TCTGCGCCAT CTACTTCGCC CCGGCCTTCT TCCCCACGGC CGACCCCACC
GTCCAGTTGC TCAACACCGC CGGGGTGTTC GCCGCCGGCT TCCTGATGCG CCCGATCGGC
GGCTGGCTGT TCGGCCGGAT CGCCGACAAG CACGGCCGCA AGACCTCGAT GCTGATCTCG
GTGCTGATGA TGTGCGCCGG CTCGCTGGTG ATCGCCTTCC TGCCGACCTA CGAGAGCATC
GGCGTGGCCG CGCCGGCCCT GCTGCTGTTC TGCCGCCTGT TCCAGGGGCT TTCGGTCGGC
GGCGAGTACG GTACCACCGC GACCTACATG AGCGAGGTGG CGCTCAAGGG CAAGCGCGGC
TTCTACTCCT CGTTCCAGTA CGTCACCCTG ATCGGCGGCC AGTTGCTGGC GGTGCTGGTG
GTAGTGATCC TGCAGCAACT GCTCAGCACC GATGAACTGA AGGCCTGGGG CTGGCGCATT
CCGTTCGTGA TCGGCGCCAT CGCGGCGGTG ATCGCCCTGT TGCTGCGCCG TTCCCTGGAG
GAAACCACCA CGGCGGAAAG CCGCGCCAGC AAGGAGGCGG GCAGCATGGC CGGCCTGTTC
AAGCACCACA AGGCCGCCTT CATCACCGTG CTCGGCTACA CCGCTGGCGG CTCGCTGATG
TTCTACACCT TCACCACCTA CATGCAGAAG TACCTGGTCA ACACGGCGGG CATGGACGCC
AAGACCGCCA GCGGCATCAT GACCTTCGCG CTGTTCTGCT ACATGCTGAT GCAGCCGCTG
TTCGGCGCCC TGTCCGACCG CATCGGCCGG CGTACCTCGA TGCTCTGCTT CGCCGCCCTG
GGCGCGCTGT GCACCCTGCC GATCCTGGCG ACCCTGAAGG GCATCGGCAG TCCCGCCCTG
GCCGGCGCGC TGATCATCCT GGGGATGGCC ATCGTCAGCT TCTACACCTC GATCGGCGGC
ATCGTGAAGG CCGAGATGTT CCCGCCGGAG GTGCGCGCGC TGGGCGTCGG CCTGTCCTAC
GCCATCGCCA ACGCCCTGTT CGGCGGCACC GCCGAATACG TGGCGCTCGG CCTGAAGTCG
ATCGGCCATG AAGAGGTCTT CTACTGGTAC GTGACGGGCA TGCTGGTGAT CGCCTTCCTG
TTCAGCCTGC GCCTGCCGAA GCAGGCCGCC TACCTGCACC ACGACCGCTG A
 
Protein sequence
MTIPSSPDLC ATADLAADRS DATPSGAAEK TEYTAAERRQ RIFAIVGASS GNLVEWFDFY 
VYAFCAIYFA PAFFPTADPT VQLLNTAGVF AAGFLMRPIG GWLFGRIADK HGRKTSMLIS
VLMMCAGSLV IAFLPTYESI GVAAPALLLF CRLFQGLSVG GEYGTTATYM SEVALKGKRG
FYSSFQYVTL IGGQLLAVLV VVILQQLLST DELKAWGWRI PFVIGAIAAV IALLLRRSLE
ETTTAESRAS KEAGSMAGLF KHHKAAFITV LGYTAGGSLM FYTFTTYMQK YLVNTAGMDA
KTASGIMTFA LFCYMLMQPL FGALSDRIGR RTSMLCFAAL GALCTLPILA TLKGIGSPAL
AGALIILGMA IVSFYTSIGG IVKAEMFPPE VRALGVGLSY AIANALFGGT AEYVALGLKS
IGHEEVFYWY VTGMLVIAFL FSLRLPKQAA YLHHDR