Gene Avin_03760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_03760 
Symbol 
ID7759336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp355983 
End bp356972 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content71% 
IMG OID643803300 
ProductABC transporter, substrate-binding protein, aliphatic sulphonate 
Protein accessionYP_002797611 
Protein GI226942538 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCAAC CTTCGGACGG CCTGTCGCGC CGCCGCTTTC TCGCCGGCAG CGCCGGCGCC 
CTGGCCCTGT CGCCGCTGCT GCTCCACGGC CGGCGCGCCG GGGCGGCCAC GCCGGGCCGG
CTGCGCGTGG CCCAGTACAA GGGCGGCGAC AAGCTGCTGC TGGAGGCCGC CGGCCTCGCC
GACACGCCCT ACCCCATCGA CTGGGCGGAG TTCGCCTCGG GCAACCTGAT GGTCGAGGCG
ATGAACGGCG GTTCGCTGGA TCTCGCCTAC GGCAGCGAGA TCCCGCCGCT GTTCGGCTAC
CTCAAGGGCG CGCGCATCCG CGTGGTCGGA GTGATCAAGG GCGACGTCAA CGAACAGACG
GTGCTGGTGC CGAAGGATTC GCCGATCCGC TCCATCGCCG ATCTGAAGGG CAAGCGCGTC
GGCTACGTGC GCGCCACCAC CACCCAGTAC TACCTGACCA AGATGCTCGA CGAGGTCGGC
CTGAGCTTCG CCGACATCCA GGCGATCAAC CTCACGGTGC CCGACGGCGC CGCCGCCTTC
CGCACCGGCC AGCTCGACGC CTGGGCCATC TACGGCTATT CGGTGCCGCT GGCGCAGACC
TCGGTCGGCG CCCGGGTGCT CAAGCGCGCC AACGGCTACC TGTCGGGCAA CTATCTGTTC
TTCGCTGCGC CGGAGGCCAT CGCCGATCCG CAGCGCCAGG CGGCGATCGC CGACTATTTC
GCGCGCCTGC AGAAGGCCTT CGCCTGGCGC CAGGCCAACC ACGAACGCTA CGCCGCGGCG
CTCGCCGCGG AGATCGGCGT GCCGATCGAG GCGGTGCTCA CCCTGCTGCG CAACGAGAGC
CAGGTGCGCC GCCTGGTAGC GGTGGACGAT GAGGCGATCC GCAGCCAGCA GGACGTGGCC
GATACCTTCC ACAAGGCCGG GGTGATCGAG CGGTCGGTGG ACGTGCGTCC GCTGTGGGAC
CGCAGTTTCG CGACCGCGTT CGCCGGCTGA
 
Protein sequence
MSQPSDGLSR RRFLAGSAGA LALSPLLLHG RRAGAATPGR LRVAQYKGGD KLLLEAAGLA 
DTPYPIDWAE FASGNLMVEA MNGGSLDLAY GSEIPPLFGY LKGARIRVVG VIKGDVNEQT
VLVPKDSPIR SIADLKGKRV GYVRATTTQY YLTKMLDEVG LSFADIQAIN LTVPDGAAAF
RTGQLDAWAI YGYSVPLAQT SVGARVLKRA NGYLSGNYLF FAAPEAIADP QRQAAIADYF
ARLQKAFAWR QANHERYAAA LAAEIGVPIE AVLTLLRNES QVRRLVAVDD EAIRSQQDVA
DTFHKAGVIE RSVDVRPLWD RSFATAFAG