Gene Plim_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2043 
Symbol 
ID9138746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2652679 
End bp2654019 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content52% 
IMG OID 
ProductRNA-directed DNA polymerase (Reverse transcriptase) 
Protein accessionYP_003630070 
Protein GI296122292 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGACGA TGCATTGCCA GGTCAAGGAT GTCTTCATGG GTTTGTGGAG TTGGTTCTTG 
AGTTGGTGGC GACCGACAGG TTCCAACCAC ACGTTGCCGG GCAGCCCTTC TTCGAATCAA
GCGGGTGGCC AGAAGCGGCA ATGGCTCGGT TCCACAACAA CTCCGAAACT TCCACGACTC
AGATATCGAT CACCCGAAAT GGTGCGACAA TGGCGCATCG AGGGGCGTGG AACTCAAGTC
GATCAGCTTC CGTACGTTTA TGCCAGATAT GCCCGTACGC GCCGACGTGA GCAGTTTCTG
AATCTGCTAA CGGATACGCG CGTCGATTTG CTGCAGCAAC ATCGGTTGCC AGTCCTCACC
ACTCTTGATG AATTAAGCCA GTTTCTGGGG CTGCCGATCA AGCGGCTGGC CGGATTGATT
CACCTCGCCG ATCATGGCCG ACCTCCCACC GAAAAAAAAT CGCATTACTG GCTGAAATGG
ATCCGCAAAA AGCGCTCGGG GCACCGGCTC ATTGAAGCAC CCAAACCAGC ACTCAAAAGA
GTCCAGGAGA TCATTCTCAG GCGAATTCTT GATCTGGTTC CCGCACATCC AGCGGCACAT
GGTTTTGTCC CCGGTTGTGA CATCGTCTCA AATGCCCAGT TGCATGCCGG GAAAGCTGTG
GTGGTGAAGT TTGATCTGAA AGATTTCTAT CCGTCGGTCA CATTTTCAAA AGTCGTCGCG
ATCTTTCGTT CCCTGGGGTA CTCTCGCGAA ATGGCTTTGT GGCTGGCGCG ATTGACAACC
TCGGCATTGC CCATGGGTAT GGCATTTCCC GATAAGAATC CCTATTCGAT TATGCTGTTC
ACACGGCGTC ATTTACCCCA AGGCGCGCCC ACTTCTCCGG CTCTGGCGAA TCTGGCGACA
TATTCACTGG ATGTACGTCT GACGGGTTTG GCCCGTCGAT TTGGTGCGAT TTACACCCGT
TACGCGGACG ATCTCACATT CTCGGGTGAT CATCGGTTTT TGAAACAGCT CAAAAGATTT
GTGCCGCTGA CTGATAAGGT GATAGTGCAG TGCCGTTTTC ATTCTCAGCC CGCGAAGCGA
AGGATTCTCC GGCAGTCGAA TCGGCAGACA GTCACTGGTG TGGTCGTGAA TCAGCGTCCC
AATTGCTCTC GAAGAGACTA CGATCAACTC AAGGCGATTT TGCACAATTG CGTTCGGCAT
GGTCATCAGA CACAGAATCG AGATCAACAG GTTGATTTTC GAGCTCATCT TCAGGGCAGG
TTGGCACATA TCCGGCATTT GAATCCTGAC CGGGGAGTGC GTTTGCAGAA GATTTTTGAG
AAGATCCACT GGAATTCATG A
 
Protein sequence
MRTMHCQVKD VFMGLWSWFL SWWRPTGSNH TLPGSPSSNQ AGGQKRQWLG STTTPKLPRL 
RYRSPEMVRQ WRIEGRGTQV DQLPYVYARY ARTRRREQFL NLLTDTRVDL LQQHRLPVLT
TLDELSQFLG LPIKRLAGLI HLADHGRPPT EKKSHYWLKW IRKKRSGHRL IEAPKPALKR
VQEIILRRIL DLVPAHPAAH GFVPGCDIVS NAQLHAGKAV VVKFDLKDFY PSVTFSKVVA
IFRSLGYSRE MALWLARLTT SALPMGMAFP DKNPYSIMLF TRRHLPQGAP TSPALANLAT
YSLDVRLTGL ARRFGAIYTR YADDLTFSGD HRFLKQLKRF VPLTDKVIVQ CRFHSQPAKR
RILRQSNRQT VTGVVVNQRP NCSRRDYDQL KAILHNCVRH GHQTQNRDQQ VDFRAHLQGR
LAHIRHLNPD RGVRLQKIFE KIHWNS