Gene Rcas_1512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1512 
Symbol 
ID5538988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1931195 
End bp1934113 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content62% 
IMG OID640893650 
ProductDNA polymerase I 
Protein accessionYP_001431623 
Protein GI156741494 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.43872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0397579 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACACA ACCGTCCGCT GCTGTTGCTG ATCGATGGTC ATGCGTTGGC GTATCGCGCC 
TTCCACGCGC TGGCGGAAGC GGGGTTGCGC TCCTCGACCG GCGAGCCAAC CTATGCGGTC
TACGGGTTCA CCTCGGCGAT GCTCAATGCC ATCGAGGAGT ATCAACCGGA ATATGCCGCA
GTGGCGTTCG ATGTCGGCAA GACTTTCCGC GACGATCTGT ACGCCGAATA CAAGGCGAAC
CGCGCCGAGA CCCCGGCTGA GTTTGAGCAG CAACTGGAAC GCATCAAGCA GGTGCTAGCG
GCGTTCGACA TTCCGATCTA TACCGCCGAA GGATACGAAG CCGATGATGT GATCGGCACG
CTGGCGCGTC AGGCGACTGA ACGCGGCGTT GATGTGCTGA TCCTCACCGG CGATACCGAT
ACGCTGCAAC TGGTCGACGA TCATGTGACC GTGCTCCTCA ACAATCCGTA TCTCCGCGGA
CCGAAAAATA CCACGCGCTA CGGCGTCGCG GAGGTGACGG AACGCTACAA AGGGCTGCGC
CCCCATCAAC TTGCCGACCT GCGCGGCTTG AAGGGCGATC CGTCCGACAA TATCCCCGGT
GTCAAAGGCA TTGGTGAGGC TGGCGCAATT GCGCTCCTCA ATCAGTTTGG CACAATCGAG
AACCTGTACG ACCATCTGGA CGAAGCGCCG AAACGCTATC AGAAGCATCT CGAAGGGCAG
CGCGACATTG CGCTGTTCAG CAAGAAACTG GCGACCATTG TGTGTGATGC GCCGGTAACG
CTCAACCTTG CAGACGCAAC GCTCGCCGAT TACGACCGCA GCCGGGTGAT CGCGGTCTTT
CAGGAACTCG AGTTTGGTGC ATCGCTGGTG AAGCGCCTTC CGCCATCCGG CAGTACGACC
GTTGCGCAGC CGCTGCCGTC CATACCGCAG GCAGACGCAT CGGCGCCGCT TCAGGCGGAC
ATGTTCGCGC CGGAAGCGCC TGCTGTGCGA GAGTCGTCCG CCGAGCCGCA GCAGTTGACC
CTGTTTGGCG ATCCAGAGGT TGTGCCGCCG CCGGTCGCGC ATGTCGAGGC GCCGCTGCGC
GCAGCGCCGG GTGAATACCG CGCTGCCTGC ACCGATGCCG ACCTGGAAGC GGTGGTCGCC
GAACTCGCCA CGGCGCCGCT GGTTGCGTTC GATACCGAAA CGCGCGGGAT GAACCCGCTG
CGCGATGATC TCGTCGGTCT GTCGCTGGCG ACGGTCCCCG GTCGAGCCTG GTACATTCCC
GTCGGGCATA CCACCGGCGA GATGCAATTG CCGCGCGATC GGGTGATCGC GGCGCTACGC
TCATTCTTTG CCGATCCGGC GCGCCCCAAG GTTGCCCACA ACGCCAAGTT TGATATCGAA
GTGCTGGAAC GCGCCGGCAT GCCGGTCGCC GGTTTGTCGT TCGATACGAT GCTGGCGGCT
GGCTTGCTCG ACAAACGTCG TAATTTGAAA GACCTGGCGT TCTACGAACT GGAACTCGCT
GAACCGCTGG ACGCAATTGG CGACCTGATC GGCAAGGGGA AGAACCAGGT CACCTTCGCA
GAGGTGCCGA TTGCCCGCGC TACGCGATAT GCTGCTGCCG ACGCCGATAT GACGCTGCGC
CTGCAACCGG CGCTCGAAGC GAAACTGCGT GCTGCCGGGA GTCTCGCCGA TATGTTCTAC
CGCCTGGAAA TGCCGCTCGT GCCGGTGCTG GTGCGGATGG AGCAGTCCGG CATTCTGCTG
GATGTTCCGT ATATGCGCGC TCTGGGGGAG CGCATGGGTC GGGAACTTGC ACAGATCGAG
CAGCAGATCT TTGCCATCGC CGGGAAACCG TTCAATATCA ACTCCGGCGA TCAGTTGAGC
GAGGTGCTGT TCGGTCCCAA GATCAACCTG CCCACAACGG GTCTCGAACG CACCCGGACA
GGGCGCTACT CGCTGACGGC GCAGGCGCTC GAAGAGTTGC AGGCAAGCGA CACTACCGGC
ATTATTGAGT TAATCCTGCG CCACCGTCGC CTGAGCAAAC TCAAATCAAC CTATGTTGAT
GAACTCCCGG CGCTGGTCAA CCCGGAGACC GGCAGAGTCC ATACCGATTA CAATCAGCTT
GGTGCGGCGA CCGGGCGGTT GAGCAGCAAT TCGCCGAACC TGCAAAATAT TCCAACCCGC
ACCGAAGAGG GGCGCGAGGT GCGGCGCGGC TTCATCGCTG CCCCCGGTCA TGTGTTGATC
GCCGCCGACT ATTCGCAGAT CGAGTTGCGC GTGCTGGCGC ACATCACCGG CGATCCGAAC
CTGATCCAGA CGTTTATTGA AGGGCGCGAC ATTCACGCAG CCACCGCCGC ACGTCTGTTC
GGCGTCGGTT TCAATGCCGT GGACAAGAAT CAGCGGCGGA TCGCCAAAAC CGTCGTTTTT
GGCGTCATCT ACGGCATCAG CCCATTCGGG TTGGCGCAAC GCCTGGGTAT TTCGCGCGAA
CAGGCGCGCA ACCTGATCGA TAGCCTGTTC AATCAGTTCC CCCGCATCCG CGACTACATC
GACCGCACGC TGGAAACCGG ACGGAATGAA GGGTATGTGC AGTCGCTCTT CGGCCGCCGC
CGCCCCATGT TCGACCTGCG CACGTCCGGT CCGCGGCGTC AGGCAGCCGA ACGTGAAGCG
ATCAACCATC CGATCCAATC CACAGCGGCG GACATCATGA AACTGGCGAT GATTGCTGTG
GATGCCGAAC TGCGTCGCCG CCGGTTGCGC ACCCGCATGT TGCTCCAGGT TCACGACGAA
TTGATCTTCG AGGCGCCGGA GGCGGAAGTG GATGAGGTGG TGGCGCTGGT GCGCGAGCGA
ATGGAAGGGG TGTTGAGCGA TATGCACCCG CCGTTCGCCG TCCCGCTGCG CGTCGAGATC
GAAAAAGGTT CGAATTGGGA AGAACTGACG CCAGTGTGA
 
Protein sequence
MAHNRPLLLL IDGHALAYRA FHALAEAGLR SSTGEPTYAV YGFTSAMLNA IEEYQPEYAA 
VAFDVGKTFR DDLYAEYKAN RAETPAEFEQ QLERIKQVLA AFDIPIYTAE GYEADDVIGT
LARQATERGV DVLILTGDTD TLQLVDDHVT VLLNNPYLRG PKNTTRYGVA EVTERYKGLR
PHQLADLRGL KGDPSDNIPG VKGIGEAGAI ALLNQFGTIE NLYDHLDEAP KRYQKHLEGQ
RDIALFSKKL ATIVCDAPVT LNLADATLAD YDRSRVIAVF QELEFGASLV KRLPPSGSTT
VAQPLPSIPQ ADASAPLQAD MFAPEAPAVR ESSAEPQQLT LFGDPEVVPP PVAHVEAPLR
AAPGEYRAAC TDADLEAVVA ELATAPLVAF DTETRGMNPL RDDLVGLSLA TVPGRAWYIP
VGHTTGEMQL PRDRVIAALR SFFADPARPK VAHNAKFDIE VLERAGMPVA GLSFDTMLAA
GLLDKRRNLK DLAFYELELA EPLDAIGDLI GKGKNQVTFA EVPIARATRY AAADADMTLR
LQPALEAKLR AAGSLADMFY RLEMPLVPVL VRMEQSGILL DVPYMRALGE RMGRELAQIE
QQIFAIAGKP FNINSGDQLS EVLFGPKINL PTTGLERTRT GRYSLTAQAL EELQASDTTG
IIELILRHRR LSKLKSTYVD ELPALVNPET GRVHTDYNQL GAATGRLSSN SPNLQNIPTR
TEEGREVRRG FIAAPGHVLI AADYSQIELR VLAHITGDPN LIQTFIEGRD IHAATAARLF
GVGFNAVDKN QRRIAKTVVF GVIYGISPFG LAQRLGISRE QARNLIDSLF NQFPRIRDYI
DRTLETGRNE GYVQSLFGRR RPMFDLRTSG PRRQAAEREA INHPIQSTAA DIMKLAMIAV
DAELRRRRLR TRMLLQVHDE LIFEAPEAEV DEVVALVRER MEGVLSDMHP PFAVPLRVEI
EKGSNWEELT PV