Gene Aave_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAave_2022 
Symbol 
ID4667550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax citrulli AAC00-1 
KingdomBacteria 
Replicon accessionNC_008752 
Strand
Start bp2199332 
End bp2202760 
Gene Length3429 bp 
Protein Length1142 aa 
Translation table11 
GC content69% 
IMG OID639823233 
Producttrehalose synthase 
Protein accessionYP_970380 
Protein GI120610702 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.113086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTC CCCTGCTGCG CCAACCCCTC CCGGCCGCGG ACGCCGCCGC CGGGCCGCTG 
CCCGAGCCGG GCCCCGTGGT GATGCCCGAA ACCCCGGAGA TCGACACGCA GGGCGACCCC
CAGTGGTACC GCGACGCCGT CATCTACCAG TTGAACGTGA AAGCGTTCTT CGACAGCAAC
AACGACGGCT ACGGCGACTT CAAGGGGGTG ACCGCCAAGC TGGACTACGT GAAGGACCTG
GGCGTCAACA CGATCTGGCT CATGCCGTTC TACCCATCGC CGCTGCGCGA CGACGGCTAC
GACATCTCCG ACTACGAGAA CGTGCATCCG CAGTACGGCA CCCTCGCGGA CTTCAAGGAG
ATGCTCGACG CCGCGCACGC GCGCGGCCTG CGCGTGATCA CCGAGCTGGT CATCAACCAC
ACGTCGTCCG AGCACCCCTG GTTCCAGCGC GCGCGGCGGG CCCCGCCCGG CTCCCCCGAG
CGGGACTTCT ACGTCTGGAG CGATACCGAC CAGATCTACC GGGGCACGCG CATCATCTTC
ACGGATACCG AAACCTCGAA CTGGGCCTGG GACCCGGTGG CCAAGCAGTA CTACTGGCAC
CGCTTCTTCA GCCACCAGCC GGACCTGAAC TTCGACAACC CGCTGGTGCT GGAAGCCGTG
TTCAAGACCA TGCGCTTCTG GCTGGACATG GGCGTGGACG GCTTCCGGCT CGACGCCATT
CCCTACCTGG TGGAGCGCGA CGGCACCAGC AACGAGAACC TGCCCGAGAC GCACGCCGTC
ATCAAGAAAC TGCGCGCGGC CATCGACGCC GAATACAGGA ACCGCTTCCT GCTCGCCGAG
GCCAACATGT GGCCCGAGGA CGTGCGCGAG TATTTCGGCG ACGGCGACGA GTGCCACATG
GCCTACCACT TCCCGCTGAT GCCGCGCATG TACATGGCGA TCGCCCAGGA AGACCGGCAC
CCCATCGTCG AGATCCTGCA GCAGACGCCC GACATCCCCG AAGGCTGCCA GTGGGCCATC
TTCCTGCGCA ACCATGACGA GCTCACGCTG GAGATGGTGA CCAGCAAGGA ACGCGACTAC
ATGTACAGCA TGTACGCGGC CGACATGCGC GCGCGCATCA ACCTGGGCAT CCGCCGCCGG
CTCGCGCCGC TGATGGAGAA CGACCTGGAC CGGGTCAAGC TCATGAACGG CATGCTGCTG
TCCATGCCGG GCTCTCCCAT CATCTACTAC GGGGACGAGA TCGGCATGGG CGACAACGTC
TTCGTGGGGG ACCGCAACGG CGTGCGCACG CCCATGCAGT GGTCGCCCGA CCGCAACGGC
GGCTTCTCGC GCTCCGACCC GCAGCGGCTC TACCTGCAGC CCATCATGGA CGCGGTGTAC
GGCTACGAGG CGCTGAACGT GGAGGCCCAG TCGGGCGACC ACAGCTCCCT GCTGCACTGG
ACGCGCCGCA TGCTGGCCGT GCGCAAGACC AGCCGCGCCT TCGGCCGCGG GCGCCGCACC
TTCCTCAAGC CCGGCAACCG CAAGATCCTG GCGTACGTGA GCGAACACGA GGACGACGTC
ATCCTCACGG TGTTCAACCT CTCGCGCGCC GCCCAGCCGG TGGAGCTGGA CCTGTCGGCC
TACCGGGGCC GCACGCCGAT CGAGATGCTG GGGCGGGTCA CCTTCCCGCC CATCGGCGAC
CTGCCCTACC TGCTCACCCT GCACTCCTAC GGCTTCTACT GGTTCCGCCT CTCCAACGAA
GCCCCGATGC CCTCGTGGCA CCAGGAGGGC CTGGACCTGC AGGAAAGGCC CGTGCTCGTG
CTGTTCGACG GGTGGACCAG CTTCTTCCGC GAACGCGTCA TGCCCTGGCG CATCGGCATG
GCCGAGCGCA TGCGCGCGCA ATTCGAGGAC GACACCCTGC CGCGCTTCAT CGAGCTGCAG
CGCTGGTATG CGGACAAGGG CGCGACCATC GCCGGCGCGC GCATCCTGGA CCACACGGTC
TGGAAGGCCG GCGAAGGCTG CGAATGGATG CTGCCGCTGC TGGGCGTGGA GCCGGCCGCG
AAGCCCGGCG GCGCCCAGGG GGCGGGCGCG ACCTACTTCG TGCCCCTGGC CCTGGCCTGG
GAAGAAGGCA GCGAGGAGCG CATGCGCCGC ATCTCTCCGG CCGGCGTGGC GCGCGTGCGC
CAGCAGGCCC AGGTGGGCGT GATGGGCGAT GCCTTCCACG ACGAGTCCTT CTGCCGCATG
GTGGTGCGTG CCATGGGCGA AGGCACCGAA CTCGCCACCG ACCAGGGCGG ACGGGTGCGA
TTCCGGCGCA CCTCGGCCTT CGACGCGCTG GCCGCGGAAC TGGACACGCT GCCCGTGGAG
CGCCCCGGCG CCCAGAGCAG CAACACCGTG GTCGCCCTCG GCGAAACCCT GTTCCTCAAG
GGCTACCGCC GCCTGCGCGA GGGCGTGAAT CCCGAACTGG AGGTCGGCCG CTTCCTCACG
GAGGTCGCCC GCTTCCCGCA CTGCGTGCCG GTGGCGGGAT CGGTGGAGTA CGTGGCGCCG
GACGGCGCCA CCATGACGCT CGCGCTGGTG CAGTCCTATG TGGCCAACCA GGGCGACGGA
TGGGAGTACA CGCTCGGCTA CCTGGAGCGC TTCCTCGAAG ACACACGCCT GGCGCAGGCG
GCCGGCCCCG TGGCGGACAT GCACGGCGGC TATCTGGCCC TGGCCGCCAC CCTGGGCCAG
CGCACGGCGC AACTGCACCA GGCGCTGGCC CGGCGCACCG GCGATGCCGC CTTCGACCCC
GAACCCGTCA CCACGCAGGA CGTGGCCGCC TTCGGCCAGC GCGCGCGCGC CGAGGCCGAG
GACACGCTCG TCCTGCTGGA GCGGCGCCTG GGCGAACTGC CGCCGGCGGC GCAGACGGAC
GCCCAGGCGG TGCTGGCGCG GCGCGCCCTC ATCATGGAGC GCCTCGCGGC CGGCGGCGCG
GACGCGCCCG CGGGCACCAA GACCCGCTTC CACGGCGACT ACCACCTGGG CCAGGTGCTG
GTGACGGGCA ATGATTTCGT CATCATCGAT TTCGAAGGCG AACCCGGCCG GCCGTTCGAG
GAACGCCGCG CCAAGAGTTC GCCCCTGCGC GACGTGGCCG GCATGCTGCG CTCGTTCAAC
TACGCGCGCT GGGCCGCCCT CAAGCACATG ACCCAGTCCA CCGAGGAGAT GGTGCGCCTG
GACGAGGCCG CGCGGCACTG GGAGCACCAG GTCCGCGATG CCTTCCTGGC CGCCTACGCG
GCGGAAGGCC TTGCGGCCGA TCCCGCCCTG ATCTCCCTGT TCGAGCTGGA GAAGGCACTC
TACGAACTCC GCTACGAACT GGGCAACCGC GTGGACTGGG CCCAGGTGCC GCTGCAAGGC
ATCCTGGCGC TGATCGGCGC GGCCGCGGCG CCCACGCCCA CGCCGAACCT GCCCGACACC
CCCGCCTGA
 
Protein sequence
MNAPLLRQPL PAADAAAGPL PEPGPVVMPE TPEIDTQGDP QWYRDAVIYQ LNVKAFFDSN 
NDGYGDFKGV TAKLDYVKDL GVNTIWLMPF YPSPLRDDGY DISDYENVHP QYGTLADFKE
MLDAAHARGL RVITELVINH TSSEHPWFQR ARRAPPGSPE RDFYVWSDTD QIYRGTRIIF
TDTETSNWAW DPVAKQYYWH RFFSHQPDLN FDNPLVLEAV FKTMRFWLDM GVDGFRLDAI
PYLVERDGTS NENLPETHAV IKKLRAAIDA EYRNRFLLAE ANMWPEDVRE YFGDGDECHM
AYHFPLMPRM YMAIAQEDRH PIVEILQQTP DIPEGCQWAI FLRNHDELTL EMVTSKERDY
MYSMYAADMR ARINLGIRRR LAPLMENDLD RVKLMNGMLL SMPGSPIIYY GDEIGMGDNV
FVGDRNGVRT PMQWSPDRNG GFSRSDPQRL YLQPIMDAVY GYEALNVEAQ SGDHSSLLHW
TRRMLAVRKT SRAFGRGRRT FLKPGNRKIL AYVSEHEDDV ILTVFNLSRA AQPVELDLSA
YRGRTPIEML GRVTFPPIGD LPYLLTLHSY GFYWFRLSNE APMPSWHQEG LDLQERPVLV
LFDGWTSFFR ERVMPWRIGM AERMRAQFED DTLPRFIELQ RWYADKGATI AGARILDHTV
WKAGEGCEWM LPLLGVEPAA KPGGAQGAGA TYFVPLALAW EEGSEERMRR ISPAGVARVR
QQAQVGVMGD AFHDESFCRM VVRAMGEGTE LATDQGGRVR FRRTSAFDAL AAELDTLPVE
RPGAQSSNTV VALGETLFLK GYRRLREGVN PELEVGRFLT EVARFPHCVP VAGSVEYVAP
DGATMTLALV QSYVANQGDG WEYTLGYLER FLEDTRLAQA AGPVADMHGG YLALAATLGQ
RTAQLHQALA RRTGDAAFDP EPVTTQDVAA FGQRARAEAE DTLVLLERRL GELPPAAQTD
AQAVLARRAL IMERLAAGGA DAPAGTKTRF HGDYHLGQVL VTGNDFVIID FEGEPGRPFE
ERRAKSSPLR DVAGMLRSFN YARWAALKHM TQSTEEMVRL DEAARHWEHQ VRDAFLAAYA
AEGLAADPAL ISLFELEKAL YELRYELGNR VDWAQVPLQG ILALIGAAAA PTPTPNLPDT
PA