Gene Avin_31400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_31400 
Symbol 
ID7762039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3246302 
End bp3247246 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content69% 
IMG OID643806014 
ProductABC nitrate/sulfonate/bicarbonate family transporter, periplasmic ligand binding protein 
Protein accessionYP_002800278 
Protein GI226945205 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0670501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTTC CGCGTTTTCT GCGCAACGGC CTGGCCGGCC TGCTGCTGGT CGCGCCCCTG 
AGCCAGGCCG CCGACGCCGT GCTGCGCATC GGCGACCAGA ACTACTACAA CGTGCGCGCC
TCGCTGGAGG CCTCCGGCGC GCTGGAGGGC GCCCCCTACC AGGTCGAATG GAAGCACTTC
CAGTCCGCCG CGCCGCTGGC CGAGGGACTG GACGCCGGGG CGCTGGACCT CGGCTTTCTC
GGCGACTCGG GATTCATCTT CCTCGCCGCC AAGGGTGCGC CGGTCAAGCT GATCGGCATC
TCCCGGCAGA ACCCGGACAC CATCGCCCTG CTGGTGCCCA AGGACTCGCC GGCCAAGGGC
ATCGAGGATC TCAAGGGCAA GAAGGTCGCC TACTGGCCGG GCGCCTGGAG CCAGCAACTG
ACCCTGCGTG CCCTGCAGAA GGCCGGCCTG CCCGGCGATT ACGTCGAGTT CGTCAAACTG
ATGCCGATCG ACGCCGCCGC CGCGCTGCCG CGGGGCAGCA TCGACGCCTT CCCGGTGTGG
GAGCCGTACA TTTCCCAGCA GATCCTCTTC TCCGGCGCGC GCCCGCTGCT CACCTCCAAG
GGCCTGATGC CGGGACTTTC CAGCATCGCC GCCAACGCCG CGTCCGTCGA GCCCAAGCGC
GCCGCCATCG CCGATTTCCT CGGCCGCCTC AAGCAGGCGC GCGCCTGGGT CGAGACACAC
AAGAGCGAGT ACGCCGAGCT CTGGGCGAAG AAGGCCAACC TCGACCCGGA GGTATCCCGC
CACTGGATCG GCCAGGCCGA CATGACCGTG GGCCCGGTGG ACGACCAGGC CGCCCGCGAC
TATCAGGAAA CCGCCGACTT CCTGCGGGAA ACCGGCGCCC TGCCCAAGGC CTTCAAGGTC
GACACGGTGA TCGATTCCTC CTTCGCCCGA ACGCTGCAAC CCTGA
 
Protein sequence
MKFPRFLRNG LAGLLLVAPL SQAADAVLRI GDQNYYNVRA SLEASGALEG APYQVEWKHF 
QSAAPLAEGL DAGALDLGFL GDSGFIFLAA KGAPVKLIGI SRQNPDTIAL LVPKDSPAKG
IEDLKGKKVA YWPGAWSQQL TLRALQKAGL PGDYVEFVKL MPIDAAAALP RGSIDAFPVW
EPYISQQILF SGARPLLTSK GLMPGLSSIA ANAASVEPKR AAIADFLGRL KQARAWVETH
KSEYAELWAK KANLDPEVSR HWIGQADMTV GPVDDQAARD YQETADFLRE TGALPKAFKV
DTVIDSSFAR TLQP