Gene Daro_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3036 
Symbol 
ID3568240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3278079 
End bp3282713 
Gene Length4635 bp 
Protein Length1544 aa 
Translation table11 
GC content51% 
IMG OID637681507 
Productputative Tfp pilus assembly protein tip-associated adhesin PilY1 
Protein accessionYP_286236 
Protein GI71908649 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3419] Tfp pilus assembly protein, tip-associated adhesin PilY1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.583565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.438129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCTT CTGCGCACCA TAACTCGGTT ATTCGTCAGG CCCTGAGTCT GCTAATCGTT 
TTTCAGCTTG GATTTTCTTC CCCATCGCAT GCTGCTTCGA TTGCGTTGGC CAGTGCACCA
CTGGCCAACT CGACCACCAC CACTGTGTTG CCCAACCTGA TGTTCCTAAT GGATAACTCA
GGCAGTATGA GTCAGGACTT TACGCCTGAC TACATGATGG AATACAACTG GCGTAACAGC
CCATGGACGT CGAATGGCTG GAATGCTCCA GACCCTGTTC AGAAGAACTG TCGTGATAGT
GCGGATGACG ATGGGTCGGT AACTACTGCG TTGGCAGATC TGGATTTGTG CGTCGTCGGT
GACGTGCCCT ATATGACAAG CTCCATTAAT TCGCAGTATT ACAACCCGGC TATCAGATAT
TTGCCGGGAG TGTTATCCGA TGGTGTCAGC AAGCCGAGCC AGACTGACCC GACCAATGTT
TTGCTGGATG GCTATGGCAA GCATAACCAG ACACAACTTG GTGTTGCCGG CACGGCAATC
GATTTGACCA CGAACTATCC TGACCGAGTC TGGTGTACCA AGAACAATCC AACTGCGGCC
GAATTGGCGG ATACCTCGGT TTGTCGGAAG AACAGCGACT ACCTTTATCC GAATGCAACG
TACAAATACG GGCGAAATAA TGATGGCACG ACGGCCAACA AGGATGTGCT CGGGGTCATG
GGGGCTCCTT ATTACTATCA AGTTGTCGTC AGCGAGTATT GTAAAGAGGC CGAGTTGCGG
AACTGCACTA CTTCCTCGGT AGCCACGGGG GACTACATCT TCCCTGCCAA ATCACGTTGG
TGTTCTGATA CGGCATTGAC GACCTGTCAA TCTACTAAAA CCTCAACCTA CAAATACCCA
CGTTATGTGG GAGCGTCAAC GGTGGCTGTG GCGGCAAGTG GGATAATCCA GGCTACTAAC
ACGACGCCTC GCACCATTAG TAGCATCACC GTAAATGGTG TGGAAATTCT TGGCGCACTG
GTCACTGGCA CTAGTCAGAT TGACCTAGCG TCCAAGGTGG CGACTCAGAT CAACGCCTAT
GCCTCCAATC CGGAGTATTC GGCAGCTCTT TATGGCGATA ATACATACGT AAAAATTACT
TCGACCGCAG CGGCTGGTGC AAGCGCAAAT GGTACGGTGG TTATTACCGG CAGAGGTATT
GTTACCAGCA ACGTTTCTGG CGGAGTGACT GGCTTCTCTG CTGCGCCATA TTCTTTCGCA
CGTACCGATA TCGTTCCGGC GACAACCAGC TATCCCAAGG CGACCTCACG CACCGACTGT
ACGGGTGCTA CCTGTACGTA TGCTGATGAG GTGACCAATT TTGCCAACTG GTATGCCTAC
TACCGGACTC GGATGCAGTC TATGAAGAGT TCTGTCAGTC TTGCCTTCCG GCCAATTGGT
TCGAATTACC GTGTTGGTTT TATGAATATT TGTAAAGGGA GCTATCTACC GGTCGCGCCT
TTCGATAATT CGGATTCCCT GATTGGTGCT GTGGCTGCGA GTGGTACTTT CCGCTTCAAC
AGCTTTACAG CAGGCGTCAT GCAAACGGTT ACTAGTATCA AAGTAAATGG TATCGAGATT
CTTGGGGCTA CAGTCACTTC CAATGTCAGT CGAGCCGATT TGGCGACGAA GTTGGCTGCT
CAGATCAACG CCTTTTTATC ATCGCCTGAG TACACTGCAG TGGCGAATTC AGGTGGTAGC
AATGGCTTGG TAACGCTGTC AGGAAGTATC GGTGATGGAG CTGGTTCCAA CGGTACAATC
GTGGTAACTG GCGGCCCTTC GCTGAGTACC AATGCCGGGG TAAGTGGTGG CGTGACTGGC
TCTGGGCAGA AGTCTAAATG GTATGAAACC TTGTTTGACC AGACGGCGGT AGGTTGCGGC
ACGCCGCTGC GTTCGGCGCT TGCTACCACT GGCCGCATTT TTGCCGGTAA GGAAATGTCG
TCTGCAGGTT CGACCTCTGG TTCGACGATT GACCCTGTTC AGTATTCTTG CCAACAGAAC
TTTACTTTGT TGACGACAGA CGGATACTGG AACGGTGCTG GCGGTACAGA TCTTGATGGC
GTCGCCATGG GTAATCTGGA TGGTGGTACA ACGCCTCGCC CGATGTTCGA GGGCAACACA
GCTACTGGCA CCCTTGCCGA CGTCGCAAAA TACTACTACG ACACCGATTT ACGAACGAGT
GGCCTGAGTA ACTGCACAGG TTCGCTCGGT ATTGATGTCT GTGAAAACAA TGTCTTTGTC
AGCAGTACTG ACAACAACCT CAAGCAGCAC ATGACGACCT TTACGCTTGG TCTTGGTGTG
GATGGTACGT TGTCATATGT TTCAGACTAC AAAAATGCGA CAACAGGTGA TTTTTACAAT
TTGAAGGAGG GTTTGGGGTC GCCTGTGGTT AATTGGCCCG TTCCTGTTGC GGATAACGAA
ACAGCAGTGG ATGACTTGTG GCATGCGGCC GTCAATGGTC AGGGAACTTA CTTTAGTGCC
AAAGACCCTG CGCAGTTGGC CTATGGCCTT TCAACGGCGC TCAATCAAAT TGGCTCCAAG
GTTGGTGCGA CGTCAGCGGC GGCTACTAGC ACTCTTAATC CGGTTGCCGG AAACAATTTC
GCCTATGTCG CTAGCTATAG CTCGGTGAAG TGGACCGGTA ATCTGGAGGC GCGAACGATC
AATACAACGA CCGGCGTCGT CAGCGAAACT GCCGCATGGT GCGTACAGAG CATCAGTGCT
GGAACGTGCG CCTTGCCAAG TACGATTGTG ACGGAGGATA CCGGAAGTAG TACGGTCACT
TACTGTGTAA CTTCCGGGGC AACTGCTGCC TCTTGTACTG ATGGCATTCT GGATGGAACA
AACTGTAAGG TTCAATTGCC TACCAGTTGT GTCGGTACGA TGAACAGCAA GGTCGATAAG
TCCAGCGACT CAAGGACTAT CTATAAAGCC AACGCATCCG GAGCCTTGGA AAGTTTTATT
TACGCCAATC TCGACTCCTC GTTATTTACG GGTACAGGGC TGAGTCAATG GTCAGTATTG
ACCGCCTCTC AGAAAACCGT GGCTGCGGGG GAGAACCTGG TTAATTTCCT GCGTGGCCAG
ACTGGATATG AGGATAGAAC TAGTAACCCT GTTGATAATC GGCTGTATCG GATGCGCGAG
GCAACGATGG GCGATGCTCT GGAGTCCCAG CCGTTCTTTA TCAGCAAGCC GGTATTTAGT
TACGCCGATG CGGGCTATGC CAAGTACAAA ACGGATTTTG CTTCACGAGC TGGCTCCGTC
TATATGGGGA CCAATGATGG GATGTTGCAC GCTTTTGCTG CCGATACCGG TGTTGAGCGC
TGGGCTTATG TTCCAACGGT TGTGATTCCC AACCTTTGGA AACTTGCCGA CAAAAATTAC
GCTACTGGGC ACGCAAACTA TGTGAACGGT AGCCCGGTTA TTTCGGATAT TTGTACGGCT
AATTGCAGTT GCGATGATGC TTGTGTTTCT GGCGGTGGGA CGGCCCCGGT GTGGAAGACC
ATCTTGGTTG GTGGGTTAAA TGCAGGGGGG AGGAGTTTCT ACGCCTTGGA TATAACAAAT
CCTGCGTCTC CGTCGTTGTT GTGGGAGGTT TCGTCAAGTA CGGCAGGGTT TACCAATCTG
GGTTATAGCT TTGGTCACCC TATTATTACC AAGAAGTCCG ATGGAACTTG GGTTGTTTTG
ATAACGTCGG GGTATAACAA TACGAGCCCC GGTGATGGCA AGGGTTATTT ATATGTGCTT
AATGCAGGCA CTGGCGCACT TATCGCTGCA ATAGGAACTA CGGTTGGTGA TACAACGACA
CCGAGTGGCT TGGCAAGGGT CTCGGCTTGG AATGACTATG GTGGCGTTAA TAACACAGCC
GGCTATGTTT ATGGTGGTGA TTTGCTTGGC AACCTCTGGC GTTTTGATAT CAATGCTGAA
AGCGTGAACA AGTTTGCGAC ATTGCTTGAT TCGTCTGGTA AAGCACAACC AGTGATGACC
AGTCCGACTT TGGGGTTGAT CAGCGGAAAG AGAGTGGTCT TCGTCGGGAC TGGTAAATAT
TTGGAAGTAT CAGATCTGAC AAATACCGAT TCGCAAACTA TTTATGCAAT TAGCGACGAT
AACAGCGGCA CGACTTTGGT CAATGCTAGG ACTGTGCTGG TTGAGCAAAC TTTGACGCGA
AACGGAACTT CGCGGACTGC GAGTAATAAT CCGGTGAATT TTGGCTCGGG CCGTGGTTGG
TATGTCGACC TGAAGGATAC AAGTCTGTCC CCATCGAATG TTGGTGAGCG AGTAAACATT
GATATGACCT TGGTTCAAGG GACTTTGATT GCTGCAAGCA TAGTTCCGTC CAACACCGTA
TGTTCGCCTG GTGGCTATGG CTGGCTTAAC TACTTTAATT ACGAGACTGG TGGATATGTG
CAGTCGGATG CCCTAGCGTC GACTTACTTC AGTTCCCCGA TTGTCGGTGT CAACCTGATA
TACATTCAGG GCAAGCCGAT TGTTGAGGTG GTTACGGCTA ACAAGCCAAC GCCCGAAATT
ACGGAAGTGC CAATCACCGG TAAGGGTAGT AACTTTGCCG GTAAGCGGGT TATGTGGCGC
GAGCTGGTTC AGTAG
 
Protein sequence
MKASAHHNSV IRQALSLLIV FQLGFSSPSH AASIALASAP LANSTTTTVL PNLMFLMDNS 
GSMSQDFTPD YMMEYNWRNS PWTSNGWNAP DPVQKNCRDS ADDDGSVTTA LADLDLCVVG
DVPYMTSSIN SQYYNPAIRY LPGVLSDGVS KPSQTDPTNV LLDGYGKHNQ TQLGVAGTAI
DLTTNYPDRV WCTKNNPTAA ELADTSVCRK NSDYLYPNAT YKYGRNNDGT TANKDVLGVM
GAPYYYQVVV SEYCKEAELR NCTTSSVATG DYIFPAKSRW CSDTALTTCQ STKTSTYKYP
RYVGASTVAV AASGIIQATN TTPRTISSIT VNGVEILGAL VTGTSQIDLA SKVATQINAY
ASNPEYSAAL YGDNTYVKIT STAAAGASAN GTVVITGRGI VTSNVSGGVT GFSAAPYSFA
RTDIVPATTS YPKATSRTDC TGATCTYADE VTNFANWYAY YRTRMQSMKS SVSLAFRPIG
SNYRVGFMNI CKGSYLPVAP FDNSDSLIGA VAASGTFRFN SFTAGVMQTV TSIKVNGIEI
LGATVTSNVS RADLATKLAA QINAFLSSPE YTAVANSGGS NGLVTLSGSI GDGAGSNGTI
VVTGGPSLST NAGVSGGVTG SGQKSKWYET LFDQTAVGCG TPLRSALATT GRIFAGKEMS
SAGSTSGSTI DPVQYSCQQN FTLLTTDGYW NGAGGTDLDG VAMGNLDGGT TPRPMFEGNT
ATGTLADVAK YYYDTDLRTS GLSNCTGSLG IDVCENNVFV SSTDNNLKQH MTTFTLGLGV
DGTLSYVSDY KNATTGDFYN LKEGLGSPVV NWPVPVADNE TAVDDLWHAA VNGQGTYFSA
KDPAQLAYGL STALNQIGSK VGATSAAATS TLNPVAGNNF AYVASYSSVK WTGNLEARTI
NTTTGVVSET AAWCVQSISA GTCALPSTIV TEDTGSSTVT YCVTSGATAA SCTDGILDGT
NCKVQLPTSC VGTMNSKVDK SSDSRTIYKA NASGALESFI YANLDSSLFT GTGLSQWSVL
TASQKTVAAG ENLVNFLRGQ TGYEDRTSNP VDNRLYRMRE ATMGDALESQ PFFISKPVFS
YADAGYAKYK TDFASRAGSV YMGTNDGMLH AFAADTGVER WAYVPTVVIP NLWKLADKNY
ATGHANYVNG SPVISDICTA NCSCDDACVS GGGTAPVWKT ILVGGLNAGG RSFYALDITN
PASPSLLWEV SSSTAGFTNL GYSFGHPIIT KKSDGTWVVL ITSGYNNTSP GDGKGYLYVL
NAGTGALIAA IGTTVGDTTT PSGLARVSAW NDYGGVNNTA GYVYGGDLLG NLWRFDINAE
SVNKFATLLD SSGKAQPVMT SPTLGLISGK RVVFVGTGKY LEVSDLTNTD SQTIYAISDD
NSGTTLVNAR TVLVEQTLTR NGTSRTASNN PVNFGSGRGW YVDLKDTSLS PSNVGERVNI
DMTLVQGTLI AASIVPSNTV CSPGGYGWLN YFNYETGGYV QSDALASTYF SSPIVGVNLI
YIQGKPIVEV VTANKPTPEI TEVPITGKGS NFAGKRVMWR ELVQ