Gene Daro_1557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1557 
Symbol 
ID3568642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1670119 
End bp1671408 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content55% 
IMG OID637680025 
Producthypothetical protein 
Protein accessionYP_284776 
Protein GI71907189 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value3.4261e-20 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.109896 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCTC ACAAAGCAGA AACCACTGGC TCCCTGAAAC TGACCTGCCT GTCAGTGATG 
CTGGCTCTTG CCGGCCTTTC TTCTGCAACC GCCCAGGAAG CAGTGGATAC CGAGAAGCTG
TTCAAGGAAG GCATTTTCCT GCGCGAGCAA GGGCAAGTAT TCAGCTCCAT CGAAGCCCTC
GAAACAGTTC TGAGCAACAA CCCTGCACTC AATCGTGCCC GTCTTGAACT GGCCGTCGCC
TACTATCGTG CGCTGAACTA CGACCAAGCC AACCAGCAGG CGCAAAAGGT TCTTGACGAC
CCGAAGACCC CGGAAAACGT TCGTCTCGCC GTGCTGGCCT TCCTTGCCCA GATCAAGCGT
GACCAAGTTG CACTGGTCGC CAAGCCGCAT ACGTTTGAAG GCTCCATCTC TCTTGGGGCT
CAATACGACT CCAACGTCAA CGTGGGTCCC GGCGGCGCCA TTCTTCCCGG TGGTCTGATT
CTCGACCCGG GTTCTGTTCC CAAGCACGAT TGGGCTTCCG TTATTCAGGC TGGCGTTACT
CATACCTACA ACTCTCCGAG CGTTGTGCGA CTCGGTGAAA CAGCAACCCG CTTCCTCTGG
CAGACCAGCG CTGGTCTTTA TCAGAAGAAC TACTTCAGCG TGACTGATTT CAATCTGACC
GCACTCAGCC TATCGACCGG TCCTGTCCTT ATTGCCCCGG ACAAGTGGCG CGCCAAACTC
AACCTGCAGG TTGATGGCCT GTGGCTGGGC GGCAACTTCC TTGGCGTCTA CACCTCGCTT
TCTCCGACGG TTACCTTGCA ATTCAAGAAT GGCGAGTTGA CCTGGGACGC TTTGGTGCTG
AACAAGGCTT TTGATCGCAC TATCGATGTC GGCCGCGACA GCAACTACTA CTCTACCGGC
GTTTCCTATG GCCACCTGTT CCTGCAAGGC AAGCTGGCGC TTCAAGGCGG CCTGCACGTC
TTCATGGAAG ATGCCTCGGC CAGCCGTTAC AGCAATGATG GTTGGGAAGC CTTCGTTGGC
GCCAATGTAG TTGCTTGGCA AAACGGTAAT GTTTATGGTC GCTACAGCTA CAAAGACACC
AAGTTTGATG GTGTTGAGCC GGTATTTGCC CTCGCCCGCG ACGAATACGA AAAACGCTAC
GAAGTCGGTT TCGGCCACAA CTTCAAGGAA GGCTTCGCGA AGGATTGGCG TCTGTCTGGC
AGCTGGCAGA AAACGGAGAA CAACTCCAAT GTCAGCATCT ACACTTACAG CCGTCAAATC
GCTGGCGTTT CGATCGGTCG CTCGTTCTGA
 
Protein sequence
MNAHKAETTG SLKLTCLSVM LALAGLSSAT AQEAVDTEKL FKEGIFLREQ GQVFSSIEAL 
ETVLSNNPAL NRARLELAVA YYRALNYDQA NQQAQKVLDD PKTPENVRLA VLAFLAQIKR
DQVALVAKPH TFEGSISLGA QYDSNVNVGP GGAILPGGLI LDPGSVPKHD WASVIQAGVT
HTYNSPSVVR LGETATRFLW QTSAGLYQKN YFSVTDFNLT ALSLSTGPVL IAPDKWRAKL
NLQVDGLWLG GNFLGVYTSL SPTVTLQFKN GELTWDALVL NKAFDRTIDV GRDSNYYSTG
VSYGHLFLQG KLALQGGLHV FMEDASASRY SNDGWEAFVG ANVVAWQNGN VYGRYSYKDT
KFDGVEPVFA LARDEYEKRY EVGFGHNFKE GFAKDWRLSG SWQKTENNSN VSIYTYSRQI
AGVSIGRSF