Gene Avin_29640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_29640 
Symbol 
ID7761865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3061777 
End bp3063606 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content66% 
IMG OID643805837 
Productoligopeptide ABC transporter, periplasmic substrate binding protein 
Protein accessionYP_002800105 
Protein GI226945032 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.113024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCCCC TCTGTCTGCT GTTGCTCAGT CTGCTCGCGA GCGGGCCGGC GATCGCGCGC 
ATCACCGAAA GCCATGGCTA CGCCCAGTTC GGCGTGCTCA AGTATCCCGT CGGTTTCCAG
CATCTCGACT GGGTCAACCC CGAGGCCCCC AAGGGGGGCA CGCTGCGCCT GATGGCCCTC
GGCACCTTCG ATACGCTGAA CCCCTACAGC TTCAAGGGAA CCAGCCCGTC GGCAACGGCC
GACTTCCTGC AGTACGGCAT CAACGAGCTG AACGAGCCGC TGATGGCCGG CTCCGGGGTC
TACGATCCCT CCGGGGACGA GCCCGCCTCC AGCTATGGCC TGATCGCCGA ATCCGTCGAA
TACAACGAAA GCCGCAGTTG GGTGGTGTTC AACCTGCGCC AGGCGGCCCG CTTCCATGAC
GGCAAGCCGA TCACGGCCCA GGACGTGGCC TTCTCCTACC GCCTGCTCAG CCAGGAGGGC
CATCCCCAGT ACCGCGCCGA GCTGCGCGAG GTGCAGCGGG TCGACGTCCT CGGCCGCCAG
CGCATCCGCT TCGTCTTCAA GCGCTCGGGC AATCCGCTGC TGATCCTGCG CCTCGGCGAG
TTGCCGGTGC TCCCCCAGCA CTACTGGAAG AACCGCGACT TCAAGGCCAC CACCTTCGAG
CCGCCCCTGG GCAGCGGCCC CTACCGCATC GTTCAGGTGC AGCCGGGGCG CCGCCTGGTG
TTCGAGCGGG TGAAGAACTG GTGGGGCGCC AAGCTGCCGA TCAACCGCGG CAAGTACAAC
TTCGACCGGG TGGATGTGGA CTTCTACCGC GACAGCGGCG TCGCCTTCGA GGCGTTCAAG
GCCGGGCAGT TCGACTTCTA TATCGAGCAC CAGGCGAAGA ACTGGGCCGG GGGCTACCAT
TTTCCGGCCG TACAGGCGGG CCAGGTGATC CACGCCGAGA TCCCGCACCG GATTCCGACC
CAGACCCAGG CGCTGTTCAT GAACACCCGC CGCAGCACCT TCGCCGATGC CAGGGTGCGC
GAGGCCCTGG GGCTGATGTT CGATTTCGAA TGGACCAATC GCGCCCTGTT CTACGGCGCC
TACCGGCGCG CCGAGAGCTA CTACCCGAAC AGCGAGTTTT CCGCCGCCGG CAAGCCGGAG
GGGGAAGAGT GGCTGCTACT TTCCAGGTAT CGCCAGCAAT TGCCCGAGCG CCTGTTCCGC
GAGCCTTTCC CGATGCCGAA GACCGACGGC CACGGCATCC CGCGCGAAAC CCTGCGCCGC
GCCCTGGCCC TGCTCGGCGC GGCCGGCTGG AAGCTCTCCG GCCAGCGACT GGTCGATGCC
CGCGGCCAGC CGCTGCGCTT CGAGATCCTG CTGGTCAATC CCAGCCTGGA GCGCATTCTC
CAGCCCTACA GCGAAAACCT CGCCGGCATC GGCATCGAGG CGCAGTTGCG TACCGTGGAT
CGCGCCCAGT ACAAGCAGCG GCTGGACCAT TTCGACTACG ACATGATCCT GCTGACCCTG
CCGCAGACCC TCAGCCCCGG CCTCGAGCAG TGGTTCTACT TCCACTCCAG CCAGATTGGC
GTGAAGGGCG GCAAGAACTA CGCGGGCATC GCCAACCCGG TGGTCGACGG CCTGCTGGAG
AGCCTGCTGG CGGCACAGAC CCGGGAACAG CAGGTCGCCG CCGTCCGCGC CCTGGATCGC
GTCCTGCTCT GGCAGCACTA CAGCATCCCC AACTGGTACA TCAATCATCA CCGCCTGGCG
TACCGCAACC GGTTCGCCTT CGTCGCCACG CCCCCCTACA CGCTGGGCCT GCGCGCCTGG
TGGCTGAAGA CCAAGGAGAA CGACCGATGA
 
Protein sequence
MRPLCLLLLS LLASGPAIAR ITESHGYAQF GVLKYPVGFQ HLDWVNPEAP KGGTLRLMAL 
GTFDTLNPYS FKGTSPSATA DFLQYGINEL NEPLMAGSGV YDPSGDEPAS SYGLIAESVE
YNESRSWVVF NLRQAARFHD GKPITAQDVA FSYRLLSQEG HPQYRAELRE VQRVDVLGRQ
RIRFVFKRSG NPLLILRLGE LPVLPQHYWK NRDFKATTFE PPLGSGPYRI VQVQPGRRLV
FERVKNWWGA KLPINRGKYN FDRVDVDFYR DSGVAFEAFK AGQFDFYIEH QAKNWAGGYH
FPAVQAGQVI HAEIPHRIPT QTQALFMNTR RSTFADARVR EALGLMFDFE WTNRALFYGA
YRRAESYYPN SEFSAAGKPE GEEWLLLSRY RQQLPERLFR EPFPMPKTDG HGIPRETLRR
ALALLGAAGW KLSGQRLVDA RGQPLRFEIL LVNPSLERIL QPYSENLAGI GIEAQLRTVD
RAQYKQRLDH FDYDMILLTL PQTLSPGLEQ WFYFHSSQIG VKGGKNYAGI ANPVVDGLLE
SLLAAQTREQ QVAAVRALDR VLLWQHYSIP NWYINHHRLA YRNRFAFVAT PPYTLGLRAW
WLKTKENDR