Gene Avin_28000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_28000 
SymboltreS 
ID7761705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2883302 
End bp2886628 
Gene Length3327 bp 
Protein Length1108 aa 
Translation table11 
GC content68% 
IMG OID643805679 
Producttrehalose synthase, maltokinase fusion protein 
Protein accessionYP_002799947 
Protein GI226944874 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCAAT CGCTCAAGTC CGTCGCCTTT CTCGAGGACC CGTTCTGGTA CAAGGACGCG 
GTGATCTACC AGGTGCACGT CAAGTCGTTC TTCGACTCGA ACAACGACGG CATCGGTGAT
TTCCCGGGGC TGATCGCCAA GCTGGACTAC ATCGCCGACC TGGGGGTCAA CACCATCTGG
CTGCTGCCGT TCTACCCTTC GCCGCGTCGC GACGACGGCT ACGACATCGC CGAATACCGC
GGCGTGCATC CAGACTACGG CAGCCTCGCC GACGTGCGAC GCCTCATCGC CGAGGCGCAC
CGGCGCGGCC TGCGGTTGAT CACCGAACTG GTGATCAACC ACACCTCCGA CCAGCATCCC
TGGTTCCAGC GGGCGCGGCA GGCCAGGAAG GGCTCGGCGG CGCGCAACTT CTACGTCTGG
TCGGATAGCG ACGACAAGTA CCGGGACACC CGGATCATCT TCCTCGATAC CGAGAAGTCC
AACTGGACCT GGGACCCGGT GGCCGGCCAG TACTTCTGGC ACCGCTTCTA TTCGCACCAG
CCGGATCTCA ACTTCGACAA CCCGCAGGTG ATGAAGGCGG TGCTCGGCAT CATGCGCTAC
TGGCTCGATC TGGGCGTCGA CGGTCTGCGC CTGGACGCCA TTCCCTACCT GATCGAGCGC
GACGGCACCA ATAACGAGAA CCTGCCCGAG ACCCACCAGG TGCTCAAGCG CATCCGCGCC
GAGCTGGATG CCCGCTATCC GGACCGCATG CTGCTGGCCG AGGCCAACCA GTGGCCGGAG
GACACTCAGC TCTACTTCGG CGGCGCCCCG GAGGGGAAGG GCGACGAATG CCACATGGCC
TTCCACTTCC CGCTGATGCC GCGCATGTAC ATGGCCATCG CCCAGGAAGA CCGCTTCCCG
ATGACCGACA TCCTGCGCCA GACCCCGGAC ATCCCGTCCG ACTGCCAGTG GGCGATCTTC
CTGCGCAACC ACGACGAACT GACCCTGGAG ATGGTCACCG ACCGCGAGCG CGACTACCTG
TGGAACTTCT ACGCCGCCGA CCGCCGCGCG CGGATCAACC TGGGGATACG CCGGCGCCTG
GCGCCGCTGC TGGAGCGCGA CCGGCGGCGC ATCGAACTGC TCAACAGCCT GCTGCTGTCG
ATGCCAGGTA CGCCGGTGAT CTACTACGGC GACGAGATCG GCATGGGCGA CAACATCTAC
CTGGGCGACC GCGACGGGGT GCGCACGCCC ATGCAGTGGA GCACCGACCG CAACGGCGGC
TTTTCCCGCG CCGATCCGGC CGGACTGGTG CTGCCGCCGA TCATGGACCC GCTGTACGGC
TTCCAGAGCG TCAACGTCGA AGCCCAGCAG CGCGACAGCT ACTCGCTGCT CAACTGGACC
CGGCGCATGC TCGCCGTACG CAAGCAGCAG AAGGCCTTCG GCCGCGGCTC GCTGCGGCTG
CTGGCGCCGG CCAACCGGCG CATCTTCGCC TACCTGCGCG AATACCATGG CGCGGACGGC
AGCAGCGAGA CCATCCTCTG CGTGGCCAAC GTCTCGCGCT CGGCCCAGGC GGTGGAACTG
GAACTGCCCC AGTTCGCCGA CATGGTGCCG GTGGAGATGA CCGGCGGCAG CGCCTTCCCG
CCGATCGGCC AGTTGCCCTA CCTGCTGACC CTGCCGCCCT ACGGCTTCTG CTGGTTCCTG
CTGGCGCCCG CGTCGCAGAT GCCCAGTTGG CACATCAAGC CCACCGAAGG CATGCCGGAA
TTCCAGACCC TGGTGCTGCG GCGTCTCGAC GATCTGCTGG TGGACCCCAA CCGGCGCATC
CTGGAGAAGG AGGTGTTGCC GGCCTACCTG CCCAAGCGTC GTTGGTTCGC CGGCAAGGGC
GCGCCCTCGG GCAACGTGCA CATCGCCTAT ACGGTGACCT TCGGCAATCT GCCGCGCCCC
GTGCTGCTCA GCGAGATCGA GGTCGGCGGC GAGGGCGGCC CGCAGCGCTA CCAGTTGCCG
CTGGGTTTTC TCCCCGAGGA GGAGTTCGGC AGCGCCTTGC CCCAGCAACT GGCCATGACC
CGGGTGCGCC GTGGCCGGCG GGTGGGCCTG TTGACCGATG CCTTCACCCT GGAGGGCTTC
GTCCGCGCGA CGGTCCAGGC GCTGCGCGAG CGCCGCGTGC TGGCCTTCGA GCAGGGCGAG
CTGCGCTTTC TGCCCACCGC GCAACTGGAC AAGGTCGAGC TGCCGGCCGA CGCCGAGGTG
CGCTACATCT CCGCCGAGCA ATCCAACAGC TCGGCGGTGC TTGGCAACGC CCTGGTGCTC
AAGCTGATCC GCCGGGTGTC CGTCGGCGTG CATCCCGAAC TGGAGATGGG CCTGCACCTC
ACCGGATACG GCTTCGCCAA CATCGCCAGC GTGCTCGGCG AGGTGTGTCG GGTCGATGCC
CAGGGCGGCT GCACCGTGCT GATGATCCTG CAGCGCTACC TGGAGAACCA GGGCGACGCC
TGGGAGTGGA CGCAGAACAC CCTCGACCGG GCCATCCGCG ACGAACTGGC CGGCGGCGAG
TCGACCCTGG AGAGCCAGTA CAGCGCGCTG CACGAGCTGG AGAGCTTTTC CCGCCAGCTC
GGCCAGCGCC TCGGCGAGAT GCACATGGCC CTGGCCGCGC CCACCGAGGA CCCGGATTTC
GCTCGCGAAA CCGCCGGTCC CGAGCAAGCG GCGCAGTGGA CCGAGAGCAT CGGCGCGCAA
CTGGGGCGGG CCCTGGAAAT CCTCGCCCAG CGCCAGGCGA CGCTGGCCGA GGAGGAACGG
GCCGCCGTCG AGCAGTTGCT GCTGCAGCGT GATGCCCTGC TGGAGAAAGT CGGCGAACTG
GCCGGGGCCT GCGTCGGCAG CCTGCGCATC CGCGTGCATG GCGATCTGCA CCTCGGCCAG
GTGCTGGTGG TGCAGGGCGA CGCCTATCTG ATCGACTTCG AGGGCGAGCC GTCGCGGCCG
CTCGCCGCGC GCCGCGCCAA GCACAGCCCT TACAAGGACG TCAGCGGCGT GCTGCGCTCG
ATCGGCTACG CCGCCGACAT GGCCATGCGC AACGCGCAGA GTGTCGACAG CTCCGAGGCG
GCCGACCAGG CCCGTCGGCG CATCGCCTCG CTGTACCAGG CGAACGCCCG CCAGGCCTTC
CTCGCCGCCT ACCGCCAGGC GGCGGCCGAG ATCGGCCATG CCTGGGCCGT GCCGGACGGC
GAGGAGGCCG CGCTGGCGCT GTTCAGCCTG GAAAAGACCG CCTACGAAAT CGCCTACGAG
GCCGAAAACC GCCCCGCCTG GCTGGCCGTA CCGCTGCGGG GCCTGGCGAA TCTGGCGCAA
CAGCTACTGG ATGGAGAGGC GCAATGA
 
Protein sequence
MDQSLKSVAF LEDPFWYKDA VIYQVHVKSF FDSNNDGIGD FPGLIAKLDY IADLGVNTIW 
LLPFYPSPRR DDGYDIAEYR GVHPDYGSLA DVRRLIAEAH RRGLRLITEL VINHTSDQHP
WFQRARQARK GSAARNFYVW SDSDDKYRDT RIIFLDTEKS NWTWDPVAGQ YFWHRFYSHQ
PDLNFDNPQV MKAVLGIMRY WLDLGVDGLR LDAIPYLIER DGTNNENLPE THQVLKRIRA
ELDARYPDRM LLAEANQWPE DTQLYFGGAP EGKGDECHMA FHFPLMPRMY MAIAQEDRFP
MTDILRQTPD IPSDCQWAIF LRNHDELTLE MVTDRERDYL WNFYAADRRA RINLGIRRRL
APLLERDRRR IELLNSLLLS MPGTPVIYYG DEIGMGDNIY LGDRDGVRTP MQWSTDRNGG
FSRADPAGLV LPPIMDPLYG FQSVNVEAQQ RDSYSLLNWT RRMLAVRKQQ KAFGRGSLRL
LAPANRRIFA YLREYHGADG SSETILCVAN VSRSAQAVEL ELPQFADMVP VEMTGGSAFP
PIGQLPYLLT LPPYGFCWFL LAPASQMPSW HIKPTEGMPE FQTLVLRRLD DLLVDPNRRI
LEKEVLPAYL PKRRWFAGKG APSGNVHIAY TVTFGNLPRP VLLSEIEVGG EGGPQRYQLP
LGFLPEEEFG SALPQQLAMT RVRRGRRVGL LTDAFTLEGF VRATVQALRE RRVLAFEQGE
LRFLPTAQLD KVELPADAEV RYISAEQSNS SAVLGNALVL KLIRRVSVGV HPELEMGLHL
TGYGFANIAS VLGEVCRVDA QGGCTVLMIL QRYLENQGDA WEWTQNTLDR AIRDELAGGE
STLESQYSAL HELESFSRQL GQRLGEMHMA LAAPTEDPDF ARETAGPEQA AQWTESIGAQ
LGRALEILAQ RQATLAEEER AAVEQLLLQR DALLEKVGEL AGACVGSLRI RVHGDLHLGQ
VLVVQGDAYL IDFEGEPSRP LAARRAKHSP YKDVSGVLRS IGYAADMAMR NAQSVDSSEA
ADQARRRIAS LYQANARQAF LAAYRQAAAE IGHAWAVPDG EEAALALFSL EKTAYEIAYE
AENRPAWLAV PLRGLANLAQ QLLDGEAQ