Gene Daro_4140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_4140 
Symbol 
ID3566640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4438766 
End bp4439926 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content60% 
IMG OID637682612 
Producthypothetical protein 
Protein accessionYP_287336 
Protein GI71909749 
COG category[S] Function unknown 
COG ID[COG1565] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones76 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATC TGCCGATTCC CGCTCCGGAG GCCCTGGCGC ACAGCCAGCG CTTGCACCAG 
GCCATTGCGG ACGAAATTGC CGCAGCCGAC GGCTGGGTGT CGTTTGCCCG CTTCATGGAG
CTGGTGCTCT ACGCACCTGG TCTCGGCTAT TACACAGCCG GGGCACGAAA GTTCGGGGCG
GCTGGAGATT TTGTCACATC ACCGGAAATG ACCCCCTTGT TCGGACAGGT ATTGACCCGT
CAGGTGGCTC AGGTCATGGC TGAATCAGCG CCGGTAGTGC TGGAGGTCGG CGCTGGCTCC
GGGCGCCTTG CCGCCGACCT GTTGCTTGCT CTTGAGCGGA TGGGGGAGCT GCCCGAACAC
TATTTCATTC TCGATCTTTC CGCCGACCTG CGACAACGCC AGAGACAGAC GATCGCCGAG
GCTGCACCGC ATCTGCTGAG CCGGGTCGAA TGGCTGGATC GATTGCCCGA GACTTTCTCC
GGGGTGGTCG TGGCAAACGA GTTGCTCGAC GCGATGCCGG CGAATATCGT TGCCTGGCGC
GAGAATGGCA TTTTCGACCG GGGGGTTGTC GTTGATGAAG CCGGGAGTTT CATGTGGAAC
GAACGTCCGG CCACGGGAAC CCTGCTCGCA GCGGCCGAGG AAATTGGCGC GCAATGTAGC
CTGCCACCCG GTTTCGAGAG TGAGATCAGT CTGACTGTCC GGGCCTGGCT CTCGGAATGG
GGGCGTCGCC TGGAAAAAGG CGCCCTGCTA CTGATTGACT ATGGTTTTCC CCGCCGCGAG
TTCTATCATC AGCAACGTGG TCGTGGGACG TTGATGTGTC ATTACCGCCA CCATGCACAT
CCCGATCCGT TTTACCTGCC AGGCTTGCAG GATGTCACGG TCCACGTCGA TTTCACCGCA
GTCATTGCGG CGGCGCATGC GGCCGGGCTG GATCTGCTCG GCTATACAAA CCAGGGGCAG
TTCCTGCTCA ATTGTGGAAT ATTGGATCAG TTGGCTGAAA TTCCCAATGG AACTCCAGAA
TATATCCGGG CGGCCGGGGC AGTTAACATG CTGCTGATGC CGCACGAGAT GGGCGAACTA
TTCAAGGTGA TCGCAGTTGG TCGCGGCATT GATGAGCCGC TATGTGGTTT TGCCAACGGT
GATCAGGGCT GGCGCCTGTA A
 
Protein sequence
MNNLPIPAPE ALAHSQRLHQ AIADEIAAAD GWVSFARFME LVLYAPGLGY YTAGARKFGA 
AGDFVTSPEM TPLFGQVLTR QVAQVMAESA PVVLEVGAGS GRLAADLLLA LERMGELPEH
YFILDLSADL RQRQRQTIAE AAPHLLSRVE WLDRLPETFS GVVVANELLD AMPANIVAWR
ENGIFDRGVV VDEAGSFMWN ERPATGTLLA AAEEIGAQCS LPPGFESEIS LTVRAWLSEW
GRRLEKGALL LIDYGFPRRE FYHQQRGRGT LMCHYRHHAH PDPFYLPGLQ DVTVHVDFTA
VIAAAHAAGL DLLGYTNQGQ FLLNCGILDQ LAEIPNGTPE YIRAAGAVNM LLMPHEMGEL
FKVIAVGRGI DEPLCGFANG DQGWRL