Gene Avin_31680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_31680 
Symbol 
ID7762068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3277533 
End bp3278507 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content70% 
IMG OID643806042 
ProductABC transporter, aliphatic sulfonate substrate-binding protein 
Protein accessionYP_002800306 
Protein GI226945233 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCCGT TCGTCTCTCT CAGCCGCCGC GCTCTGCTCG GTCTCGGCCT CGCCCTGGGG 
CTGTGCGCCG CCCTGCCCGC CCAGGCGGAA ACCGAGCTGC GCATCGGCTA CCAGAAATCC
TCCACCCTGA TCGTCCTGCT GAAGGCCCGC GGCACCCTGG AAAAGACCTT GTCCGCCCAG
GGTATCCGCC TCAGTTGGCA CGAGTTCACC AGTGGCCAGC CGCTGCTGGA GGCGCTCAAC
GTCGGCAACC TGGACCTGAC CGCCGATGTC GCCGATACCG TGCCGGTGTT CGCCCAGGCC
GCCGGCGCCC ATCTCGCCTA TTTCGCCCAG GAGGCGCCAT CGCCGGCCGC CCAGGCGATC
CTGGTGCGCG CCGACTCGCC GCTGCGCGGT CTGGCCGATC TCAAGGGCAA AAGGGTGGCG
GTGACCAAGG CCGCCGGCAG CCACTACCTG CTGCTCGCCG CACTGGCCGA GGCCGGTCTG
AAGTTCTCCG ACATCGAGCC GGCCTACCTG ACCCCGGCCG ACGGCCGCGC CGCTTTCGAG
AATGCCAAGG TGGACGCCTG GGTGACCTGG GAACCCTTCC TCAGCGGCGC CCAGCGCCAG
TTGCCGACCC GCACCCTGGC CGACGGCGAG AAGCTGGCCG CCTACCAGCG CTACTACCTG
ACCAGCCAGC GCTTCGCCAA GGAGCACCCG CAGGTGCTGG AGGCGGTGTT CGCCGAGCTG
GTCAAGGCCG GCGACTGGCT GCGCGCCAAT CCCCGGGAAG CCGCACGGAT TCTCGCGCCG
CTATGGGGCA ACCTGGACCC GGCGATCGTC GAACAGGCCA ACGCCCGACG CAGCTACCGG
GTACGTCCGG TACAGCTGGA GAGCCTGGCC GAGCAGCAGA AGATCGCCGA CGCCTTTTTC
GCCGAAGGGC TGCTGCCGAA GCAGGTCGAC GCCCGCGACG TGTCCATCTG GCAACCGCAG
ACGGCCGCCC GCTGA
 
Protein sequence
MPPFVSLSRR ALLGLGLALG LCAALPAQAE TELRIGYQKS STLIVLLKAR GTLEKTLSAQ 
GIRLSWHEFT SGQPLLEALN VGNLDLTADV ADTVPVFAQA AGAHLAYFAQ EAPSPAAQAI
LVRADSPLRG LADLKGKRVA VTKAAGSHYL LLAALAEAGL KFSDIEPAYL TPADGRAAFE
NAKVDAWVTW EPFLSGAQRQ LPTRTLADGE KLAAYQRYYL TSQRFAKEHP QVLEAVFAEL
VKAGDWLRAN PREAARILAP LWGNLDPAIV EQANARRSYR VRPVQLESLA EQQKIADAFF
AEGLLPKQVD ARDVSIWQPQ TAAR