Gene Daro_2406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2406 
Symbol 
ID3567590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2594358 
End bp2595875 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content55% 
IMG OID637680873 
Producthypothetical protein 
Protein accessionYP_285612 
Protein GI71908025 
COG category 
COG ID 
TIGRFAM ID[TIGR02602] eight transmembrane protein EpsH (proposed exosortase)
[TIGR02914] EpsI family protein
[TIGR03109] exosortase 1 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.150586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCTAG TCACTAGACT CCCTACGGCC TGGCTACGCA CTTTACCGCT ACTGCTGGCT 
CTGATCGGCT GGATCCTCTT CTGGTATTGG GGAACAATGT CGGCGATGCT GGAAATCTGG
GCGCGCTCAG ACACTTACGC CCATGCGTTT ATCGTGCCGC CTATCGCCCT GTGGTTGATC
TGGCGCAAGC GCGATGAACT TCTTGCTATC CAACCAACAG CCTCCGGCTG GCTGGCTCTA
CCGCTGGCAG TAGCAACTTT CGCTTGGCTC CTGGGCCAAT TGACGGCAGT CAATGCACTG
ACCCAGTTTG CACTTGTGAT TACGTTGATT CTTTCCATCA TGTCCCTCCT GGGACTGAGG
ATCAGCCGCC ACATCGCCTT TCCGCTCGCC TTCCTGCTCT TCTCTGTCCC AATCGGTGAT
TTCATGATGC CGAAATTGAT GGACTGGACA GCGGCCTTCA CGGTTACCGC GCTACGCGCG
ACCGGCATTC CGGTTTACCA AGAAGGCAAC CAATTTGTTA TTCCATCCGG CAATTGGTCG
GTAGTCGAAG CATGCAGCGG CATCCGCTAC ATCATCGCGT CAGTAACGGT CGGTACCCTT
TTCGCCTATC TCAATTTTGT CACGTTACGC CGTCGCTTGA TTTTCATTCT GGTTTCGATG
CTGGTTCCAG TGGTCGCCAA CTGGTTGCGC GCCTACATGA TCGTCATGCT CGGCCACTTT
TCCGGCAACA AACTCGCGGC CGGTGTCGAT CACCTGATCT ATGGATGGCT GTTCTTTGGC
GTGGTAATCA TGGCAATGTT CATGATCGGG ACCCGGTGGG CCGAGTCCCC CGCCGCCTCG
CAACCAGCTC TCTTTACCTC GCCCACTACG GCTAAATCAG GCTGGCTAGC CAGCCTGATT
ATCGCCTTGC TTGCAGCAGC TGGACCACTT GCTTTCGCCG CCATCGACAA GTTGGACAAG
GCCAGCGAAC CGAAACTTCC AACACTATCC ATAGAGAATG GCTGGCAGAA CCGGCCCCTT
TTTGCCAGTT GGCAACCTGC CTACGATTCC CCGCCTGCAA AACTTCAGAC TGCGTTCAGT
CAAGATGCTA AAACTGTTGG TCTGTACGTG GCCTATTACC GGAACCAGGA CTATCAACGC
AAGCTGGTCA CATCAACCAA TATGCTGGCC AAGTCCAACG ACACTGTGTG GTCTGTTCTA
TCCCGGGACA CTGCAAATAT AAATATCGAC GGCCTCCCCC CGGTGGTTCG TACAGCCCAG
ATTTTAGGGA AAGACACCAG CCCTCCAAGC AATCTCATTG TTTGGCAGTG GTACTGGGTC
AACGGCAAGC TCGTTACTTC CGAAGCCGAG GCCAAACTAC AGACAGCCCT GTCTCGTCTG
CGTGGCGCCG GTGATGATTC TGCCGTCATC ATGATTTATG CACCAAGCGA ATCAGCTGCC
GACACATTGC CCGCTTTCTC AACCCAGGCT GCGGGAACCA TCAACCAATG GCTTGCGGCG
ACTCGCGACG CTCGATGA
 
Protein sequence
MFLVTRLPTA WLRTLPLLLA LIGWILFWYW GTMSAMLEIW ARSDTYAHAF IVPPIALWLI 
WRKRDELLAI QPTASGWLAL PLAVATFAWL LGQLTAVNAL TQFALVITLI LSIMSLLGLR
ISRHIAFPLA FLLFSVPIGD FMMPKLMDWT AAFTVTALRA TGIPVYQEGN QFVIPSGNWS
VVEACSGIRY IIASVTVGTL FAYLNFVTLR RRLIFILVSM LVPVVANWLR AYMIVMLGHF
SGNKLAAGVD HLIYGWLFFG VVIMAMFMIG TRWAESPAAS QPALFTSPTT AKSGWLASLI
IALLAAAGPL AFAAIDKLDK ASEPKLPTLS IENGWQNRPL FASWQPAYDS PPAKLQTAFS
QDAKTVGLYV AYYRNQDYQR KLVTSTNMLA KSNDTVWSVL SRDTANINID GLPPVVRTAQ
ILGKDTSPPS NLIVWQWYWV NGKLVTSEAE AKLQTALSRL RGAGDDSAVI MIYAPSESAA
DTLPAFSTQA AGTINQWLAA TRDAR