Gene Aave_2355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAave_2355 
Symbol 
ID4669792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax citrulli AAC00-1 
KingdomBacteria 
Replicon accessionNC_008752 
Strand
Start bp2574196 
End bp2576535 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content66% 
IMG OID639823558 
Producthypothetical protein 
Protein accessionYP_970704 
Protein GI120611026 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.512259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000418473 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCCGCTC TTGAGGTGCG CGTCCATCGC CGCACGGGGG TGGGCGAGCA CATCCTGAAC 
GATGCGCCGC TGGAGTTCGA CCAGGACGAG GACGGGAAAG AGCCGCATCC GCTGGACGGG
CCGGAAGCGC GGGCGACCCT GAAGAAGTGC CTGGGCTGGT ACTACCGCGA GCGCGACATT
CAGGCCGTGA ACCGCATGGA CATGGCGATC GACGCCGATA TGTACGACGG CGACCAGTGG
GACCCCGCCG ACGCGGCGGT GCTGGAGGAG CGCGGCCAGG TGCCCCTGGT GTTCAACGAA
GTGGCGCCGA TGGCGGACTG GATGATCGGC ACGGAGCGCC GGTCGCGCGT GGACTGGAGC
GTCCTGCCGC GCACCGAGGA CGACGTGCAG ATGGCCGACG TGAAGACCAA GGTGATGAAG
TACGTCACCG ACGTGAACCG GTCCGCCTTC AACCGCTCGC GCGCCTTCGA GGACGCTGTG
AAAGTCGGTA TCGGCTGGGT CGATTCCGGC GTGCGCAACG ACCCGACCAA GGACATCATT
TACGACAAGT ACGAGGACTG GCGGAACGTG CTGTGGGACT CCATGGCCAT CGAGATGGAC
CTGAGCGATG CCCGCTACGT CTTCCGCACG CGATGGGTGG ATGACGATGT GGCCGCGGCC
ATGTACCCGG AGCGGGCAGA CGTGGTGCGC CGGGCCGTGC AGCATGACCG CGACTACAAC
GCCCAGCAGT GGGCGGAAGA CGAATTCAAC TACCAGGGCT ATGCGGGCTC GACGCGCAGC
GGCAGCTACA TGGCCAGCGG GCAAAGCTCG GCGGACAGTG AGCCGCGCCG CAGGGTGCGG
CTGATCGAAT GCCAGTTCCG CATGCCCGTG CAGGCGCGCG TGGTGGTCAC CGGCCCGTTC
AAGGGCTCGA TCGTAGAACC CTGGGACAAC GCCCTGCAGG CGGTGGTGGC CGCATTCGGG
GGCTCCGTCG TGGACCGCGT GCTGATGCGG ATGCACATCG CCGTCTTCAC GGAAGGCCAC
CTCCTAGCGC TGGGCCCGAT GCCAATGCGG CACAACAGTT TCAGCCTGAC CCCGATCTGG
TGCTACCGCC GCGGCCGAGA CCGGCAGCCT TACGGCGTGA TCCGCCGCGT GCGGGATCTG
CAGGCCGACC TGAACAAGCG CGCGAGCAAG GCGCTGTTCG CTCTGAGCAC GAACCAGATT
TTTGCCGAGC ACGGCGCGGT GGACGACATC AATGAGACGC GCGAGGAGGC AAACCAGCCG
GACGGCGTGA TCCTCTACAA GGCGGGGAAG AAGCTGGAGG TCCACCGGGA CTCCGAGATG
GCGGCGGGAC AGGTGCAGAT GATGACCATG GACGCCCAGG CCATCCAGCG CAACGCGGGC
ATCTCAAACG AGAACCTGGG CCGGCAGACC AACGCAAGCA GCGGCGAGGC GATCAAGGCC
CGGCAGATGC AGGGAAGCGT GGTCACGACC CAGCCCTTCG ACAACTTGCG GTTCGCGACC
CAGACCCAAG GCGAGAAGCT GCTGTCTCTC ATCGAGCAGT GGTACACGGA AGAGAAGGTC
ATCCGGCTGT CCGGCCACAA GGGAAAGCTC GACTGGGTGA AGATCAATCA GCCCGAGCAG
CAGGCCGACG GTTCGGTGCG CTACCTGAAC GACATCACGG CCAGCGTTGC GGATTTCGTG
GTGTCCGAAC AGGACTATGC CGGCACGCTG CGCCAGGTGA TGTACGAGTC CATGGTGAAC
TTGGCCGGCC GCATGGATCC CGCGACGGCC ATGCGCCTGA TGACGCTGGC CATGGACTAC
TCGGACCTGC CCAACCACGA GCAGATGGCT GCCGAAATGC GCAAGCTGAC CGGCGAGCGC
GACCCGAATA AGCCCCTCAC GCCCGAGGAG CAGCAGCAAA TGCAGCAGCA GATGCAGGCC
CAGGCGGAGG CGCTGCAGAT GCAGCAGGCC ACGGCGCGTG CGGCGCTGGA CGAGCAGTTG
GCACGCGTCC GCGAGGTCAA CGCCCGCGCC CAGAAGATGG AAGCAGAGGC CGAGCAGCTG
CGTGCTGGCG GGGATGGAGC CCAGGCGCAG CAACTGGAGG GCGTGGCCGC CACGGTCCGC
CGCGATGCGG ACCTGGAGCT GGAAAACGTG CGCCGCCAGC TCGCCAAGGC GCAGGCCGAC
CTGGCCAACA AAACCCTGCA GATCAAGGCT GACGGCGACG TGCGCATGCA GGTTGCCCGG
ATCGAAGCCG ATTCGCGCGA GCGTGTCGCC GAGATCCAGG CGGCGAGCAA GGAGCGGCTG
CAAGCCATGG ACGAACGGCT GGCCGCATTC GAGGCGGCGC CTCAGGAGAA AACTGCATGA
 
Protein sequence
MSALEVRVHR RTGVGEHILN DAPLEFDQDE DGKEPHPLDG PEARATLKKC LGWYYRERDI 
QAVNRMDMAI DADMYDGDQW DPADAAVLEE RGQVPLVFNE VAPMADWMIG TERRSRVDWS
VLPRTEDDVQ MADVKTKVMK YVTDVNRSAF NRSRAFEDAV KVGIGWVDSG VRNDPTKDII
YDKYEDWRNV LWDSMAIEMD LSDARYVFRT RWVDDDVAAA MYPERADVVR RAVQHDRDYN
AQQWAEDEFN YQGYAGSTRS GSYMASGQSS ADSEPRRRVR LIECQFRMPV QARVVVTGPF
KGSIVEPWDN ALQAVVAAFG GSVVDRVLMR MHIAVFTEGH LLALGPMPMR HNSFSLTPIW
CYRRGRDRQP YGVIRRVRDL QADLNKRASK ALFALSTNQI FAEHGAVDDI NETREEANQP
DGVILYKAGK KLEVHRDSEM AAGQVQMMTM DAQAIQRNAG ISNENLGRQT NASSGEAIKA
RQMQGSVVTT QPFDNLRFAT QTQGEKLLSL IEQWYTEEKV IRLSGHKGKL DWVKINQPEQ
QADGSVRYLN DITASVADFV VSEQDYAGTL RQVMYESMVN LAGRMDPATA MRLMTLAMDY
SDLPNHEQMA AEMRKLTGER DPNKPLTPEE QQQMQQQMQA QAEALQMQQA TARAALDEQL
ARVREVNARA QKMEAEAEQL RAGGDGAQAQ QLEGVAATVR RDADLELENV RRQLAKAQAD
LANKTLQIKA DGDVRMQVAR IEADSRERVA EIQAASKERL QAMDERLAAF EAAPQEKTA