Gene Daro_3727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3727 
Symbol 
ID3568160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4005579 
End bp4007297 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content61% 
IMG OID637682200 
ProductTPR repeat-containing protein 
Protein accessionYP_286926 
Protein GI71909339 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.239991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCGA AGTTCATCGC TGCCGCCATC TCCCTCGCCC TGGGCCTCAC CCAAGGCGGA 
TACGGTCTGG CCGCCGACGA GCCCCCCGCC AAGCCTATCG CCAAACGCAT GGCGGCCCGC
GTAGCATCCT CGGAGGATCT GCTGGCACGC ACCGTTTTCC AGGCACTGGT CGGCGAATTC
GCCCTGCAGC GCGGTGATGC CAGGCTCGGC TCCGATGCCT GGACCGATCT CGCCCAACGT
ACCCGCGACC CCAAGGTGAT CGCCCGGGCC ACCGAAGTTG CCGGGTTTGC CCGCCAGTAT
GACCGCGCAC TCGAACTCTC GAAACTGTGG ATAGAAATCG AACCCGATTC CGCACAAGCC
AGACAAACGC AATCTTCCCT GCTGGTCATG GCCAACCGCC TTGACGACCT GGCGCCGCAA
CTGACTGCGC TACTTGAACA GGACAAGCCA AATCTGGCCA GCAACCTGCT GCATCTGAAT
CGCATGCTCT CCCGCATCGG CGACAAGAAA TCCGCACAAG CCCTGGTCGA TCGTATTGCA
ACCCCCTACG ACAGCCTGCC CGAGGCTCAT TTTGCGATGG CCCAGGCGGC ATCGAACGCC
AACGACAACC TGCGCGCCCT GAACGAAACA GAAAAGGCCT TGCAACTCCG CCCCGACTGG
GAAATGGCCG CGCTGGCTCG CGCCCAACTA CAAGCGCGCC AATCCGGCAA GACGGCGATA
GACAGCCTCG CAGACTTTGT CAGCCACAAC GCTACCGCCC GCGATGCCCG CCTGACCCTG
GCCCGTCTGC TGATCAGCGA AAAACGCTAC AGCGAAGCGC GCCAGCATTT TGATCGCCTG
ATCAAGGACA ACCCGGACAG CCCGGAAGTC ATTTACCCGG TTGCCATGCT CGCCCTGCAG
CAGGGTGACG CAAACACCGG CCGTAAACAG CTCGAACACT TGCTGACCAC CGACTTCCCG
GACAAAAACA CCGTGCACTT CTTCCTTGGC CAACTCGACC AGGAACAGAA AAAACCGGAA
ATGGCACTCG AGCACTTCCG CCAGGTCACT GGCGGCGAGC AATACATCGC GGCCCGCTCG
CGAGCCGCCC AGATCCTGCT GCAACAGGGC AAAAGCGAAG AGGCCCGCGA ACTCCTGCAC
AACACGCGCG GCGGCACGGT CGCCGAACGC ACCCAGTTAA CCCTGGCTGA ATCACAGCTA
TTGCGCGAAG CAGGCCGCCA TAACGACGCA TACATCGTTC TCGATAGTGC CCTTTCCGTG
CAACCCGACA ATACCGAGCT GCTCTACGAA GCAGGCCTGA CGGCAGAACG CATCGGCAAA
CCGGAGCTGC TTGAAACCCA CCTCAAACAA CTGCTCGCCA TCAAACCCGA CCATGCCCAC
GCCCTGAATG CACTGGGCTA CTCCTGGGCC GAACGCAACA TTCGCCTGCC GGAAGCCCAT
GACCTGATTG CCAAGGCGCT CAGTCTGGCA CCGGAAGATC CCTTCATCAT GGACAGCATG
GGCTGGGTGC TCTATCGCCA GGGCAAGCTC ACCGAAGCCC TGCAAACACT GGAGCAGGCC
TACAAGATCA AGGCCGACCC GGAAATTGCC GCCCACCTCG GCGAGGTTCT GTGGGCGCTG
GATCGCAAGG ACGAAGCCCG CAGTCTCCTC AAAGCGGCTG CCAAGGCCAA TCCAGACAAC
GAAGTGCTGA TCGGCGCCGT CAAGAAACTA CTGCCTTGA
 
Protein sequence
MSPKFIAAAI SLALGLTQGG YGLAADEPPA KPIAKRMAAR VASSEDLLAR TVFQALVGEF 
ALQRGDARLG SDAWTDLAQR TRDPKVIARA TEVAGFARQY DRALELSKLW IEIEPDSAQA
RQTQSSLLVM ANRLDDLAPQ LTALLEQDKP NLASNLLHLN RMLSRIGDKK SAQALVDRIA
TPYDSLPEAH FAMAQAASNA NDNLRALNET EKALQLRPDW EMAALARAQL QARQSGKTAI
DSLADFVSHN ATARDARLTL ARLLISEKRY SEARQHFDRL IKDNPDSPEV IYPVAMLALQ
QGDANTGRKQ LEHLLTTDFP DKNTVHFFLG QLDQEQKKPE MALEHFRQVT GGEQYIAARS
RAAQILLQQG KSEEARELLH NTRGGTVAER TQLTLAESQL LREAGRHNDA YIVLDSALSV
QPDNTELLYE AGLTAERIGK PELLETHLKQ LLAIKPDHAH ALNALGYSWA ERNIRLPEAH
DLIAKALSLA PEDPFIMDSM GWVLYRQGKL TEALQTLEQA YKIKADPEIA AHLGEVLWAL
DRKDEARSLL KAAAKANPDN EVLIGAVKKL LP