Gene Avin_29650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_29650 
Symbol 
ID7761866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3063603 
End bp3065444 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content65% 
IMG OID643805838 
Productoligopeptide ABC transporter, periplasmic substrate binding protein 
Protein accessionYP_002800106 
Protein GI226945033 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000703906 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCATT CCCTGCGCAC CCTGCTCCAC GGCGCCGGCC TGTTGTTGCT CGGTTTCGCC 
GGCCTGGTCC AGGCAGACCC GCAGCACGCC ATCACCCTGT ACGACGAGCC GCCCAAGTAC
CCGGCCGACT ACCGGCACTT CGAGTACGTC AATCCGGACG CACCGAAGGG AGGCACGCTG
CGCCTGGCCG ACTACGGTGG CTTCGACAGC CTCAACCCCT TCATTCCCAA GGGCAACGTG
GAACGCCGCA TCGGCATGGT CTACGACAGC CTGACCTATC ATGCGCAGGA CGAGCCCTTC
ACCGAATACG GCCTCATCGC CGAGAAGATC GAGAAGGCCC CGGACAACGG CTTCGTGCGC
TTCTACATCA ATCCCAAGGC GCGCTTCCAC GACGGCCGGC CGATCACCGC CGAGGATGTG
AAATTCACCT TCGAGACCCT GATCGAGCAC GGCGACCCGA TGTACCGCCA CTACTACGCG
GACGTCGCCC AGGTGGTGGT CGAGGAGCCG CTGAAGGTAC GCTTCGACTT CAAGCACCGC
GACAACCGCG AGCTGCCGCT GATCCTCGGC CAGTTGCAGA TCCTGCCCAA GCACTGGTGG
GAAAGCCGCG ACTTCGCCAA GACCAGCCTG GAAGCGCCGC TCGGCAGCGG GCCGTACCGT
GTCGCCAAGC TGGAGTCCGG TCGCTCCATC CGCTACGAGC GGGTCGCGGA CTGGTGGGCC
AAGGACCTGC CGGTCTCCCG CGGCCAGTAC AACTTCGACG CCATCGTCGT CGACTACTAC
CGCGACATGT CGGTCGCCCT GGAAGCCTTC AAGGGCGGGC AGTTCGACCT CAACCTCGAA
TACTCCGCCA AGGATTGGGC CACCGGCTAC GAATCCGCCG CCCTCAATGA CGGCCGGATG
ATCAAGAAGG CCGTCCCCAA CCACAACCCG GTCGGCATGC AGGCCTTCGC CTTCAACATC
CGCCGGCCCC TCTTCCAGGA CCGCCGCGTG CGCGAGGCGC TCGGCCTGCT GTTCGACTTC
GAATGGTCGA ACAAGCAACT GTTCTTCAGT TCCTACAAGC GCACCAGCAG CTACTTCGAG
AACTCGGAAA TGGCCGCCCA CCAGTTGCCC GACAAGGAAG AGCTGAAGAT TCTCGAACCC
TTGCGCGAAC AGTTGCCGCC GGAGGTGTTC AGCGAGGTCT ACCGGCCGCC GGTGACCAAC
GGCGACGGCA TCATCCGCGA CCAGAAGCGC CGTGCCTACC AGTTGCTCCA GGAAGCCGGC
TACCGCATCG AGAACGACCG CATGGTCGGC CCCGACGGCA AGCCGCTGGC CTTCGAATTC
ATGCTCCACC AGACCAACCT GGAACGCATC CTGCTGCCCT ACAAGCGCAA CCTCGGCGAA
CTCGGCATCG ACATGCAGAT CCGCCGCGTC GACGTTCCCC AGTACATCAA CCGCATGCGC
AACCGCGACT TCGACATGAC CAGCGCCACC TGGCCGCAGT CCAACTCGCC GGGCAACGAG
CAGCGCGAGT TCTGGCACTC CAGCAGCGCC GACAACCCCG GCAGCCGCAA CTTCATCGGC
CTGCGCGACC CGGCCGTCGA CCGGCTGGTC GACGGGCTGA TCCGCGCCGA CTCGCGGAAA
GGCCTGGTCG CCCACGCCCG CGCCCTCGAC CGTGCCCTGC AGTGGGGCTT CTACGTGGTG
CCCAACTACC ATGTGAACAC TTGGCGCATC GCCTACTGGA ACAGGTTCGG CCAGCCGCAG
AAGACGCCGC TGTACGACTA CGGCCTGATG ACCTGGTGGC AGGACAGCGA CAAGCCGCAG
CCCCGGGACG AAGCGGTGGC GCACAAGGAG GAAGGTCGAT AG
 
Protein sequence
MTHSLRTLLH GAGLLLLGFA GLVQADPQHA ITLYDEPPKY PADYRHFEYV NPDAPKGGTL 
RLADYGGFDS LNPFIPKGNV ERRIGMVYDS LTYHAQDEPF TEYGLIAEKI EKAPDNGFVR
FYINPKARFH DGRPITAEDV KFTFETLIEH GDPMYRHYYA DVAQVVVEEP LKVRFDFKHR
DNRELPLILG QLQILPKHWW ESRDFAKTSL EAPLGSGPYR VAKLESGRSI RYERVADWWA
KDLPVSRGQY NFDAIVVDYY RDMSVALEAF KGGQFDLNLE YSAKDWATGY ESAALNDGRM
IKKAVPNHNP VGMQAFAFNI RRPLFQDRRV REALGLLFDF EWSNKQLFFS SYKRTSSYFE
NSEMAAHQLP DKEELKILEP LREQLPPEVF SEVYRPPVTN GDGIIRDQKR RAYQLLQEAG
YRIENDRMVG PDGKPLAFEF MLHQTNLERI LLPYKRNLGE LGIDMQIRRV DVPQYINRMR
NRDFDMTSAT WPQSNSPGNE QREFWHSSSA DNPGSRNFIG LRDPAVDRLV DGLIRADSRK
GLVAHARALD RALQWGFYVV PNYHVNTWRI AYWNRFGQPQ KTPLYDYGLM TWWQDSDKPQ
PRDEAVAHKE EGR