Gene Rcas_0007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0007 
Symbol 
ID5537464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4718 
End bp6460 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content64% 
IMG OID640892172 
Productphosphotransferase domain-containing protein 
Protein accessionYP_001430164 
Protein GI156740035 
COG category[L] Replication, recombination and repair 
COG ID[COG1796] DNA polymerase IV (family X) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0296814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000565114 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGACAGCAT ACCCGACTAA TCACGATATT GCGGAAGTTT TCAGCGCCAT TGCCGATCTG 
ATGGAGATTC TGGACGAGGA TCGGTTTCGC GTGCAGGCGT ATCGGCGCGC CGGCGATGTG
ATTCGTCATC TGCCGGCGCC GCTGGCGACC TACCGCGCTC GCGGTGAATT AGAGCAGATC
CCTGGCGTTG GCAAAGCCAT CGCCGAAAAG ATCGGCGAAC TCCTCGATAC CGGGGAGTTG
CCGTACTACA ACCGGCTCCG GGAGAAGGTT CCTCCCGGCG TGCGTGAATT GCTGCGCGTT
CCTGGCATCG GTCCGCGCAC TGCCGGTCGC CTCTACCGCG AACTCGGGAT CACCAGCCTG
GCAGAGTTGA AGGTTGCTGC CGAAGCCGGG CGCCTGGCGG CCCTTAAGGG GTTTGGTGCG
AAAACCATTG ACAGCATTCT GCAAGGCATC AGCGCGGCGG AGCGGCAGGA GCGTCGGATG
CTGCTGGCGC ACGCGATCGA TAGTGCCGAA GCGTTGATCA ACGCTCTGCG CGCCGCTGTG
CCGGCGCTGA GTCAGGCGGC GTATGCCGGC AGCCTGCGCC GTGGCCGCCC CACTGTTGGC
GATCTCGACA TTCTGGCGGC TGCCGATGAT GCGCCCGCTG TTGTGCGCGC CTTTACGATG
CTGCCGCTCG TGGCACGGGT CGAGTCGGCA GGGGACGAAA AAGCCAGCAT TCTGCTCCAT
AATGGCATGC AGGCGGACCT GATCGCGGTT CCGCCGGGCA TGTGGGGGTC GGCGTTGCAG
CACTTTACCG GCAGTAAAGC GCACAATATC CACTTTCGTG AGCTGGCGCT GGCGCAGGGA
TTGAGTTTCA GCGAGCATGG CTTCCGTCGT GCCGATGGCA CGCTGCTGAC ATGCGCCACC
GAGGAAGAGG TGTACGCTGC CATCGGTCTG CCCTGGATTC CACCGGAATT GCGCGAGGAC
GAGGGGGAGT TCGAGGCGGC GCGCGCCGGC ACGTTGCCGT GCCTGGTCGA ACTCAGCGAC
ATCCGCGCCG ATCTCCATCT GCACAGCACC TGGAGCGACG GACGCGCCGA TATTCGCACC
ATGGCAGAAG CCGCGCGCAC CCGTGGCTAT TCCCATATCG CTATCACCGA CCATAGCGCG
TATATGGGGA TGACTCACGG ATTGGATGCA GAGCGCCTGC GCGCACAGCG CCAGGAGATC
GCAGCATTGA ATGCCGAATA TGCGGCGCGC GGTATTCCGT TTCGCATCCT GCACGGCGTC
GAGGTCGATA TCACTCCTGA AGGAAATCTG GCATTGCCCG ACGATGTGCT GGCGGAACTC
GATATTGTTG TCGCTTCGGC ACATATTCAG TTGCGTCAGT CGCCCGAAGC AGCGACCGAG
CGGTTGATCC GCGCCGTGCG CAATCCGCAC GTCGATATCA TCGGGCATCC GGTGGGGCGG
ATGCTGGGAT CACGCGACGG CGCGCCGGTC GATATCGATG CGCTGGCGTA TGCCGCTGCC
GAGCATCGCG TGCTGCTGGA GGTCAACAGC GGACCGCACC GCCTCGACCT GGATGGCGCC
GCAGTGCGGC GCGCGCTGGC GTCTGGCGCT GTCATTACCA TCAACAGCGA TGCGCACCAT
CCCGACAATC TGGCGTGGAT GCGGTTCGGC GTCGTCACGG CTCGGCGCGG TTGGGCTGGT
GCGGCGCAGG TGGCGAACAC CTGGAGTGAT GAAGCGCTTC AGGAGTGGTT GAGTCGACGT
TGA
 
Protein sequence
MTAYPTNHDI AEVFSAIADL MEILDEDRFR VQAYRRAGDV IRHLPAPLAT YRARGELEQI 
PGVGKAIAEK IGELLDTGEL PYYNRLREKV PPGVRELLRV PGIGPRTAGR LYRELGITSL
AELKVAAEAG RLAALKGFGA KTIDSILQGI SAAERQERRM LLAHAIDSAE ALINALRAAV
PALSQAAYAG SLRRGRPTVG DLDILAAADD APAVVRAFTM LPLVARVESA GDEKASILLH
NGMQADLIAV PPGMWGSALQ HFTGSKAHNI HFRELALAQG LSFSEHGFRR ADGTLLTCAT
EEEVYAAIGL PWIPPELRED EGEFEAARAG TLPCLVELSD IRADLHLHST WSDGRADIRT
MAEAARTRGY SHIAITDHSA YMGMTHGLDA ERLRAQRQEI AALNAEYAAR GIPFRILHGV
EVDITPEGNL ALPDDVLAEL DIVVASAHIQ LRQSPEAATE RLIRAVRNPH VDIIGHPVGR
MLGSRDGAPV DIDALAYAAA EHRVLLEVNS GPHRLDLDGA AVRRALASGA VITINSDAHH
PDNLAWMRFG VVTARRGWAG AAQVANTWSD EALQEWLSRR