Gene Avin_18670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_18670 
Symbol 
ID7760801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1848053 
End bp1849816 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content67% 
IMG OID643804765 
Productextracellular solute-binding protein 
Protein accessionYP_002799054 
Protein GI226943981 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATT ATCCGAACGA CAGCTTTTCC CTTTCCTCCC TGGTCGGTCA CTCGCCCGGC 
AAGTCCAGTC GCCCCGAAGC GCCGGCGGCA TTTGTCCCCG GCCAGTCGAA CGGCACCGCG
CCGGGCGCAC TGCGCAGTCT GGCGCGCCGC CTGGCCGGCC TGGTGATTCC CGTGGCCGCC
ACGCTCGCCC TGGCCGGCTG CTCGCCGTCG GCCGAGGACG GCCAGGCCGC CAGGACCCTG
AAGATCGCCT TCTGGGGCGA CAACACCGTG CTGGTCAGCG TCGATCCCTT CCAGGTCTAC
TGGATCGAGC ATCGCGTGCT GTTGCGCAAC GTCGCCGAAT CGCTGACCGA CCAGGACCCG
AAGACCGGCG AGATCATTCC TTGGCTGGCG AAAAGCTGGG AAGTGAGCGA CGACGCCCTG
GAGTACACCT TCCACCTGCG CGAGGACGTC ACCTTCAGTA ACGGCGAGCG TTTCGACGCC
CAGGCGGTGA AGATCGCCTT CGACAGCAAC AAGGCGTTCG CCGCCGAGGT GCCGTCGACT
TTCGGCGCCA CCTACCTGGC CGGCTACGAG CATGCCGAGG TGCTCGACGC TTTCACCGTC
AAGCTGGTGC TGTCGCGGCC CAATGCCGGT TTCCTGCAGG CCGCCTCCAC CACCAACCTG
GCGATCCTCG CGCCCGCTTC CTACCGACTG ACGGCCAGGG AGCGTTCCCT CGGCAAGATC
GTCGGCAGCG GCCCCTTCGT CCTGGAAAGC TACACCCCGG AAGTCGGCGC CAGACTGGTC
AAGCGCAAGG ACTACGCCTG GCCTTCGGCG AACCTGAAGA ACCCCGGCGC GGCGCACCTG
GACAGCGTCG AACTCAGCTA CGTGCCGGAG GAAAGCGTGC GCAACGGCCT GTTCCTGCAG
GGGCAGGTCG ACATCCTCTG GCCGCGCAAC CCTTTCTCCG AGGTGGACCT GAAGCTGTTC
CAGTCCAGGG GCGCCACCAT CCAGAGCCGT TCGCTGCCGG GGCCGGCCTT CAACCTTTAT
CCGAACGCCC AGGACAAGCG TGTCCTGGCC GACCCCAGGG TACGCCTGGC GCTGCAGAAG
GCGATCGACC GCAAGACCTA CGCCGCCACC ATCTACAACC CGGATTTTCC AGTGGTGGAC
GGGGTGTACG ACCTGACCAC GCCTTATTTC AAGACCCAGG GCGCCAAGCT GGCCTACGAC
CCGGCCGGCG CGGAGCGCCT GCTCGACGAG GCCGGCTGGG TCAAGGGCGC CGACGGCTAC
CGGCAGAAGG ACGGCAAGCG CCTGAGCCTG ACCTACATCC TGTCGCCCGC CGAAACGGCC
GGCGACGTGC TGGTTCAGGA TCAACTGCGC AAGGTCGGCA TCGAGCTGAA GCTCGACGTG
CTCACCCGCG CCGAGCGGGT CACGGCCAAC GCCGCGGGCA ACTACGACCT GACCTCCAGC
TACATGAGCC GTGCCGATCC GATCATCCTG CAGACCATTC TCGATCCGCG CACGGCCAAC
AGCGCCGCCC TGGCCAGCAA CATCTATTCC CCGCAGACCC TGGAGCGCGC CACGGCGCTG
TTCGACGCCG GCATCACCGC GACCGCCGGC GGGCAGCGCG CCCGCGCCTA TGGCGAACTG
CAGGACCTGC TGATCGACGA GGGCCTGGCC TTCCCGATCT ACGAGCGCGT CTGGCAGGCC
GCCACCGCGC CGCGCGTGCG CAACTTCCAG TGGTCCGCCG AGGGCTTCGC CTTCCTCAGC
GACATCGAGG TGGACCAGCC ATGA
 
Protein sequence
MSDYPNDSFS LSSLVGHSPG KSSRPEAPAA FVPGQSNGTA PGALRSLARR LAGLVIPVAA 
TLALAGCSPS AEDGQAARTL KIAFWGDNTV LVSVDPFQVY WIEHRVLLRN VAESLTDQDP
KTGEIIPWLA KSWEVSDDAL EYTFHLREDV TFSNGERFDA QAVKIAFDSN KAFAAEVPST
FGATYLAGYE HAEVLDAFTV KLVLSRPNAG FLQAASTTNL AILAPASYRL TARERSLGKI
VGSGPFVLES YTPEVGARLV KRKDYAWPSA NLKNPGAAHL DSVELSYVPE ESVRNGLFLQ
GQVDILWPRN PFSEVDLKLF QSRGATIQSR SLPGPAFNLY PNAQDKRVLA DPRVRLALQK
AIDRKTYAAT IYNPDFPVVD GVYDLTTPYF KTQGAKLAYD PAGAERLLDE AGWVKGADGY
RQKDGKRLSL TYILSPAETA GDVLVQDQLR KVGIELKLDV LTRAERVTAN AAGNYDLTSS
YMSRADPIIL QTILDPRTAN SAALASNIYS PQTLERATAL FDAGITATAG GQRARAYGEL
QDLLIDEGLA FPIYERVWQA ATAPRVRNFQ WSAEGFAFLS DIEVDQP