Gene Aave_3355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAave_3355 
Symbol 
ID4665755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax citrulli AAC00-1 
KingdomBacteria 
Replicon accessionNC_008752 
Strand
Start bp3708505 
End bp3711528 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content59% 
IMG OID639824551 
Producthypothetical protein 
Protein accessionYP_971687 
Protein GI120612009 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.18974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCA CCAGTGAAGT CGCATTCGAA ACCGCCATCG AGGCGGTCTT GCTGAGTGAA 
GGATATACGC GGGTCGACGT CAAAGGCTTC GACCGCGAGC GCGCGATTTT CCTCGATGAG
GCACTGGCCT TCATCCGTGC CACCCAGGGC AAGGTGTGGG AGAAGCTAGA AGCCCTGCAC
GGCGAGCAGA CCGGTGCGCG CGTGCTGGAG TCGCTATGCA AGTGGCTGGA TACCCATGGC
GCCCTGGCCA CGCTGCGCCA CGGTTTCAAA TGCTTCGGGC GCACGCTGCG GATAGCCTTC
TTTCGCCCCG CCCACGGCCT GAACCCGGAA CTGGAGGCGC GCTACCAGGC CAACCGGCTG
GGTCTGACCC GCCAATTGCA TTTCAGCCCG AAGTCTGAGA AGTCTCTGGA TGTAGTGTTG
TCGGTCAACG GCATTCCGGT GGTGACGCTG GAACTCAAGA ACCCCTTGAG CGGCCAGACG
GCAGCGAATG CCATCCACCA GTATCGCCAC GACCGCGATC CGCGTGAACC AATCTTCGAG
TTCACCAAGC GCGCGCTGGT GCACTTCGCT GTAGATACTG AAGAAGCACA CATGGCCACT
CGCCTGGCCG GTTCGTCCAC TTACTTCCTG CCGTTCAACC GGGGCATGGA CGGTGGCGCC
GGTAACCCGC CCGATCGTGA AGGGCGCAAC TACAAGACAG CCTACCTGTG GGAAGAGGTG
CTACAGCGCG ACAGCCTGCT CGACCTGCTC GCCCGTTTCC TGCATCTCGA TGTGGAAGAA
AAGACCACTA ACGTCGGTAA GAAGGTGCGC AAGGAAAGCC TGATCTTTCC GCGTTATCAC
CAGTTGCAGG CTGTGCGCCG GATGGTGGCG GCCGCCGCCA GCGAGGGCGC GGGGCATAAC
TACTTGGTCG AGCATTCGGC CGGCAGCGGC AAGAGCAACA CGATTGCCTG GCTGGCGCAC
CGTCTGTCCA GCCTGCACAA CGAGCGCGAC GAGCGACTGT TTGACAGCGT GGTGGTCATC
ACCGACCGGG TGGTGCTCGA CCGCCAGTTG CAGAACACCA TCTACCAGTT TGACCACCGC
CAGGGCGTTG TGCAGAAGAT CGATGAGGAC TCGCGCCAGC TCGCCGAAGC GCTGGAAGCC
GGCGTGCCAA TCATCATCAC CACGCTGCAA AAATTTCCGT TCGTGTCCGG GCAACTAGCC
AAGCTTAGTG AGGAGCGTGG CGAAGGCAGC AAGAGCCATC TGCCCACGCG CAAGTACGCT
GTAATCATCG ACGAGGCGCA CAGTTCGCAA TCGGGCGAGA CGGCCTCGGA GTTGAAAGGT
GTGCTGGGCG GTGCCGAGCT GCGCCGCAAG GCACAAGAGA TGGCCGAGGA GGAAGGCGAA
GTCGAACTGG AGCGACTGTT CCGTTCCATG GCCAAGAGGG GCCATCAACC GAACATGAGC
TTTTTCGCTT TCACCGCCAC GCCCAAACAC AAGACATTGG CGATCTTTGG CCGGGGCGGC
GAGCCCTTCC ACCGCTACAC CATGCGCCAG GCCATCGAGG AGGGCTTTAT CGAGGATGTG
CTCAAGAGCT ACGTCACCTA CAAGACCTAC TACAAGCTGA TCAAGAAGGC CGAGGACGAC
CCCAACGTCG AGCGCAAGAA GGCGGCCAAG GCGTTGGCCC GTTTCATGCG GCTGCATCCG
CACAACATTG GCCAGAAGAC CGAGGTGATG GTCGAGCATT TCCAACACTT CACGCGGCAC
AAGATCGGCG GCCATGCCAA GGCGATGGTG GTGACCGGCT CGCGACTGGA AGCGGTGCGC
TACAAGCAGG AGTTCGACCG CTACATTCAG GAGAAGGGCT ACCCCATCAA GAGCCTGGTG
GCGTTCTCGG GCACGGTGGA AGACGACAAG ATTCCGGAGA AGTCATACAC GGAAGTCGAG
ATGAATGGCG GCCTGAAGGA GAAGGAACTG CCAGACACCT TCGCCAAACC GGAATTCCGC
GTGCTGCTGG TGGCCGAGAA ATACCAGACC GGCTTCGACC AGCCGTTGTT GCACACCATG
TACGTGGACA AACGGTTGGC GGGCATTCAA GCCGTTCAGA CCTTGTCGCG CTTGAACCGA
ACCCACCCAC TCAAGGACGA TACCTTCGTG CTCGATTTCG TCAACGATCC CGACGAAATC
CAGGAGGCCT TCCGTCAATA CTACGACGGC TCGGTGATGG GCGAGCAGGT AGATCCAGAC
AAGCTCTACG AGGTGAAGGC CGAACTGGAT GCCTCGGGCA TTTACCTGCA AACGGAGATT
GTCGAGTTTG CGCATGTGTT CTTTGCGCCT AAGCGCCGCC AGAGCCCTGG TGACCATAAG
CTGATGAATG CGACTCTTGA TCTGGCGGTC GCGCGCTTTG TGCGGTTGCA GAACACCGAA
GAAGACGAAG CGGAGTTGTG GCGCAGCAAG TTGCAGGCCT TTCGCAATCT GTACGGTTTC
CTGAGCCAGG TGATTCCCTA TCAGGACAGC GATCTGGAAA AACTCTTCAC CTACCTGCGC
CATCTCGCGC TGAAGTTGCC CAAGCGCAAG AGCGGGCCGG GCTATCAGTT CGACGAGGAA
GTCGAACTCG ATTACTACCG CTTGCAGAAA ATCAGCGAAG GCTCGATTAG CCTCAATGAG
GGCTATGCCA AACCGCTTGA CGGCCCGCGA GAGGTGGGCT CAGGCATGGT GCGTGAGGAG
CCCGTTTCGC TGTCACGCCT GATCGACATC ATCAATCAGC GCTTCGGCGG CGAGTTAAAT
GAGGCAGATC AATTGTTTTT CGATCAGATC ACCGAAGCTG CAAGCCAGAA CGAGTCCCTA
CAGAAGGCAG CAGAGGTCAA CTCTCTGGAC AAGTTCCAGC TCGTATTTCG GCAGGTCCTT
GAGTCACTAT TTATCGAGCG CATGGAGTTG AATGAGGAGC TGTTCACTGA TTACATGGGC
AAGCCAGAGA TGCGGGAGCT GGTGTCCAAG TGGTTGGGTA GCCAAGTTTA CGCTCGTCTT
TCGGATAGGG CACCGCAGAG CTAA
 
Protein sequence
MKRTSEVAFE TAIEAVLLSE GYTRVDVKGF DRERAIFLDE ALAFIRATQG KVWEKLEALH 
GEQTGARVLE SLCKWLDTHG ALATLRHGFK CFGRTLRIAF FRPAHGLNPE LEARYQANRL
GLTRQLHFSP KSEKSLDVVL SVNGIPVVTL ELKNPLSGQT AANAIHQYRH DRDPREPIFE
FTKRALVHFA VDTEEAHMAT RLAGSSTYFL PFNRGMDGGA GNPPDREGRN YKTAYLWEEV
LQRDSLLDLL ARFLHLDVEE KTTNVGKKVR KESLIFPRYH QLQAVRRMVA AAASEGAGHN
YLVEHSAGSG KSNTIAWLAH RLSSLHNERD ERLFDSVVVI TDRVVLDRQL QNTIYQFDHR
QGVVQKIDED SRQLAEALEA GVPIIITTLQ KFPFVSGQLA KLSEERGEGS KSHLPTRKYA
VIIDEAHSSQ SGETASELKG VLGGAELRRK AQEMAEEEGE VELERLFRSM AKRGHQPNMS
FFAFTATPKH KTLAIFGRGG EPFHRYTMRQ AIEEGFIEDV LKSYVTYKTY YKLIKKAEDD
PNVERKKAAK ALARFMRLHP HNIGQKTEVM VEHFQHFTRH KIGGHAKAMV VTGSRLEAVR
YKQEFDRYIQ EKGYPIKSLV AFSGTVEDDK IPEKSYTEVE MNGGLKEKEL PDTFAKPEFR
VLLVAEKYQT GFDQPLLHTM YVDKRLAGIQ AVQTLSRLNR THPLKDDTFV LDFVNDPDEI
QEAFRQYYDG SVMGEQVDPD KLYEVKAELD ASGIYLQTEI VEFAHVFFAP KRRQSPGDHK
LMNATLDLAV ARFVRLQNTE EDEAELWRSK LQAFRNLYGF LSQVIPYQDS DLEKLFTYLR
HLALKLPKRK SGPGYQFDEE VELDYYRLQK ISEGSISLNE GYAKPLDGPR EVGSGMVREE
PVSLSRLIDI INQRFGGELN EADQLFFDQI TEAASQNESL QKAAEVNSLD KFQLVFRQVL
ESLFIERMEL NEELFTDYMG KPEMRELVSK WLGSQVYARL SDRAPQS