Gene Avin_12520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_12520 
Symbol 
ID7760195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1218510 
End bp1220123 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content71% 
IMG OID643804155 
Productdipeptide ABC transporter, periplasmic substrate-binding component 
Protein accessionYP_002798454 
Protein GI226943381 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0296833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCTCC GACGCCCGAC CGCCCTCCTG CTCGCCGGCC TGCTCGCCGG CGCCGTCCAG 
GCCGCGCCGA GCAGCCTGGT GGTCTGTACC GAGGGCAGCC CGGAAGGCTT CGACATCGTC
CAGTACACCG CCGCCGTCAG CGCCGACGCC TCGGCGGAGA CGGTGTTCGA CCGCCTGCTG
CGCTTCGCTC CCGGCGGCAG CGAACTGCTC CCCGCCCTCG CCGAGCGCTG GGAGGTGTCC
GCCGACGGCC TCGAATACAC TTTCCACCTG CGCCGCGGCG TGAAATTCCA GCGCACGCCC
TGGTTCATGC CGAGCCGCGA ATTCAACGCC GACGACGTGA TCTGGAGCTT CCGGCGGCAG
ATCGATCCCG CGCATCCCTG GCACAAACTG TCGCCGCGCG GTTTCCCCTA TGCCGAATCG
ATGGCCGTGG GCGAGCTGAT AGAACGCATC GAGCGCCTCG ATGAGCACCG CGTGCGCTTC
GTCCTGCGCC ACCCGGAGGC GCCCTTCCTG GCCAACCTGG CGATGGGCTT CGCCTCCATC
TACCCGGCCG AATACGCCGA TCGACTGCTG GCCGCCGGCA CCCCGGAGCG CCTCAACAGC
CAGCCGGCGG GCAGCGGCCC CTTCATCTTC GAACGCTACG AGAAGGACGC CCAGGTGCGC
TTCCGGCGTA ACCCGGACTA CTGGGACGGC GCGCCGGCCA TCGAGCGGCT GATCTTCGCC
ATCACCCCCG AGCCCAACGT GCGGGTGCAG AAGCTCAAGG CCGGCGACTG CCAGATCGCC
CTCTACCCGC GCCCGGTGGA CCTGCCCGGC CTGCGCCGGG ACGTGCGCAT CCAGGTGCTG
GAGGACGAAC CGCTGCTGAC CGCCTACATC GGCATCAACA CCCGTCATCC GCCGCTCGAC
GACGTGCGGG TGCGCCAGGC GCTCAACCTC GCCTTCGACA AGTCCGCCTA CCTGCGCGCC
CAGTACGGCG AGGGCGGCGC CAGCCCGGCG GTGGCACCCT ACCCGCCGAG CCTGTGGGGC
TCCGACCCGA CGCTCGCCGG CTGGCCCCAC GACCCGGCGC GAGCCCGCGC CCTGCTGGCC
GAAGCCGGAC ACGCCCAAGG CCTGAAGCTG AGTATCTGGA CCCGTCCCGG CGGCGGCCCG
ACCAACCCCA ACCCCGGCAT CGGCGCCCAG TTGCTGCAGG CCGACCTGGC CGCCATCGGC
ATCCGGGCGG AAATCCGCGT GCTGGAATGG GGCGAGCTGA TCAAGCGGGC GAAGAACGGC
GAGCACGACC TGGTGTTCAT GGGCTGGGCC GGCGATAACG GCGACCCGGA CAACTTCCTG
ACCCCCAATC TGTCCTGCGC CGCGGCCGCC TCGGGCGAAA ACCAGGCCGG CTGGTGCGAC
GAGCGCTTCG ACGCCCTGTT GCGCGAGGCG CGCCGCACCA CCGACCAGGC GCAACGCACC
GCGCTCTACC GCCAGGCCCT GGCGATCTTC CACGAACAGG CGCCCTGGAT TCCCCTGGCC
CATCCCCGCG AGTTCGCCGC CGTGCGCCGC GACGTCGAGG GCTTCGTGAT CAGCCCGCTG
GGCACCAACA ACTTCGCCGG CGTGCGTCGC GCCCCGGCCA ACGCTCCGGA GTGA
 
Protein sequence
MMLRRPTALL LAGLLAGAVQ AAPSSLVVCT EGSPEGFDIV QYTAAVSADA SAETVFDRLL 
RFAPGGSELL PALAERWEVS ADGLEYTFHL RRGVKFQRTP WFMPSREFNA DDVIWSFRRQ
IDPAHPWHKL SPRGFPYAES MAVGELIERI ERLDEHRVRF VLRHPEAPFL ANLAMGFASI
YPAEYADRLL AAGTPERLNS QPAGSGPFIF ERYEKDAQVR FRRNPDYWDG APAIERLIFA
ITPEPNVRVQ KLKAGDCQIA LYPRPVDLPG LRRDVRIQVL EDEPLLTAYI GINTRHPPLD
DVRVRQALNL AFDKSAYLRA QYGEGGASPA VAPYPPSLWG SDPTLAGWPH DPARARALLA
EAGHAQGLKL SIWTRPGGGP TNPNPGIGAQ LLQADLAAIG IRAEIRVLEW GELIKRAKNG
EHDLVFMGWA GDNGDPDNFL TPNLSCAAAA SGENQAGWCD ERFDALLREA RRTTDQAQRT
ALYRQALAIF HEQAPWIPLA HPREFAAVRR DVEGFVISPL GTNNFAGVRR APANAPE