Gene Dgeo_2280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2280 
Symbol 
ID4059229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp2402370 
End bp2403545 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content66% 
IMG OID641231330 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_605743 
Protein GI94986379 
COG category[S] Function unknown 
COG ID[COG4102] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000359779 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTCG ATCGCCGTGA CTTCCTGAAA TACTCCGCCC TCGCCGTCGC CGCAACCAGC 
GGCATGCCGG GCTTTCTCGC CCGTGCCGCC ACCCAGGCGA GCGGCACCCG GACGCTGGTC
GTGATCCAGC TCACCGGGGG CAATGACGGA CTCAACACTC TGATTCCCTA CTCCAACGGC
GCCTATTACG CCGCGCGGCC CAACATCGCC ATTCCCAAAA AGGACGTGCT GACCCTCACC
CCCGACCTCG GCATGCATCC TGCGCTCAAG CCGCTGATGC GCCTGTGGGA TGCTGGGCAG
CTCGCCTGGA TGGAGAATGT CGGCTACCCC AACCCCAACC GCAGCCATTT TGCGAGCATG
GCGATCTGGC ACACCGCCGA CCCCATGCAG GCGCAGGCAG AGGGCTGGAT TGGTCGCATC
GCGGAAAAGA TCGGTGATCC CTTTTGCGCG TCGAATCTGG GCAGCGTGAC ACCGCTGGCC
CTGCAGGCAG CTGACTTTAG CCTGCCCAGC ATCGACAGCG TGGACAACTT TCAGGTGAAG
CTCCCGGCAG GGCTAGACGG TGCCTTCCAG GCCCTGCTGA ACACCGCGCG CAGCGGCGAG
GCGGCCTACC TCCAGCGCGC CACCCGGCAG ATGCTCGCCA ACACGCAGAA GGTGCAGCAA
AACGTCTCGA AGTACCGCCC AGGTGCTCAG TATCCTGAGG GCCGGTTCGC CGCGCAGTTG
CAGGACGCGG CCCGGCTGAT TGCGGCGGGA ACTGGACAGC GGGTGCTGTA CGTGACCCTG
GGCAACTTTG ACACCCACGC CGGACAGCGC GCTGAACAGG ACGAACTCCT GGGGCAGCTC
GCCGCGGGCC TCGCGGCGTT CCAGGCCGAT CTGGAGGCGC AGGGCCTCGC AGAGCGGGTG
ATGGTGATGG GCTTTTCCGA GTTCGGGCGG CGAGTGGCCG AGAACGCCAG CGCGGGCACC
GACCACGGCA AGGGCAGCGT GATGTTCGCC CTGGGCCGAG GCGTCAAGGG CGGCATCCAC
GGCGATAGCC CCGACCTGGA AAACCTGTCC GACGGGGACA TCCAGTACAA GCAGGATTTC
CGCGGCGTGT ATGCGGAGGC GCTGACCAAA TGGTTGGGAC TGGACGCCCG GGAGATCCTG
CGAGGCGACT TCCAGGGACC GGGATGGGTG GCCTGA
 
Protein sequence
MPLDRRDFLK YSALAVAATS GMPGFLARAA TQASGTRTLV VIQLTGGNDG LNTLIPYSNG 
AYYAARPNIA IPKKDVLTLT PDLGMHPALK PLMRLWDAGQ LAWMENVGYP NPNRSHFASM
AIWHTADPMQ AQAEGWIGRI AEKIGDPFCA SNLGSVTPLA LQAADFSLPS IDSVDNFQVK
LPAGLDGAFQ ALLNTARSGE AAYLQRATRQ MLANTQKVQQ NVSKYRPGAQ YPEGRFAAQL
QDAARLIAAG TGQRVLYVTL GNFDTHAGQR AEQDELLGQL AAGLAAFQAD LEAQGLAERV
MVMGFSEFGR RVAENASAGT DHGKGSVMFA LGRGVKGGIH GDSPDLENLS DGDIQYKQDF
RGVYAEALTK WLGLDAREIL RGDFQGPGWV A