Gene PCC8801_2964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2964 
Symbol 
ID7104405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3062082 
End bp3063905 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content41% 
IMG OID643475996 
ProductRNA-directed DNA polymerase (Reverse transcriptase) 
Protein accessionYP_002373111 
Protein GI218247740 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGATGTA AACCGATAAC TGAGTCGCCC GAAGATATCA ACGCTCAACT AAATTGGATA 
ATGATTGATT GGGATGACTT AGAAAAGCGT GTATATAAGC TACAAAAGCG CATTTACAAA
GCGTCTCGTC GTGATGATGT CAAGACAGTT CGCAGACTCC AAAAAACCCT AACAAAATCC
TGGGCGGCAA AATGCCTAGC GGTGCGTCGT GTTACCCAAG ATAATCAAGG TAAAAAGACG
GCTGGTGTGG ATGGTGTGAA ATCACTGACC CCAAAGCAAC GTCTAAACCT CATAGATAAA
CTAAAATTGG GTACGAAGGT CAAACCCACT CGGAGAGTAT GGATTCCGAA ACCTGGGACT
GAGGAGAAAA GACCTTTAGG AATACCGACC ATGTATGACC GCGCATTGCA AGGGCTTGTC
AAACTGGCAT TAGAACCAGA ATGGGAAGCT AAATTTGAAC CTAACAGTTA TGGGTTCAGA
CCAGGACGCT CATGTCAAGA TGCCATCGGA GCAATATTCC TAGCAATAAA CAAAAAAGCC
AAATATGTGC TTGATGCTGA TATTGCCAAA TGTTTCGACC GCATTGACCA TGAACAACTC
CTAAATAAAT TAAATACCTA TCCGACCCTA CGGAAACAAA TCCGAGCTTG GCTAAAAGCT
GGAGTCATGG ATGGAAAAGA GTTGTTCCCA ACATCTGAGG GTACGCCACA AGGAGGGGTT
ATATCACCTC TACTAGCAAA CATAGCCCTC CATGGGATGG AAAACGAAAT CAATAAACTA
GCTGAAACAT TCGATATGAG AGGTCCCGAC GGTAAACTAC TAGGCAAGAG GGACAAAAGA
AAATCAGTTA GTCTTATTCG TTACGCCGAT GACTTCGTAA TCCTCCACGA AGACATAACC
ATTGTCCAAA GATGTAAAGA GTTTATCTCT GAATGGTTAA AAGACATGGG ATTGGAATTG
AAACCAAGTA AAACCCGATT AGCCCATACA TTAGAGGAGT ACAACAAAGA AAAACCTGGC
TTTGATTTCC TCGGATTTAA CGTCCGTCAA CACAAAGTAG GAAAGTTTAA CTCTGGAAGG
GTAAAAGGAA AGCTATTAGG TTTTAAGACC ATTATAACTC CAAGCAAGGA AAGCCAGAAA
AGACACTACA AAAAGATTGC AGAAACAATA GAAAAGCACA AAGGGAAAGC CCAGGCAATT
CTAATAAGAA ACCTTAACCC AATAATCAGA GGATGGTGCA ATTATTTCTC AACCGTAGTA
AGTCAAAAAG TCTTCGAGAG ACTATGGCAT TTAACTGTCT GGAAACTAAT CAAATGGGGT
TTGAAACGCC ATCGAAACAA AGGAAGGAAA TTTATAGTTT CCAAATATTT CCAAAATATA
GGTGGTAATA ATTGGGCATT CGCAACCAGG CAAGAAGGTA AAAACCCGAT GCGGTTACTA
CAACATAGCG ATACAACAAT CACCCGCTAT GTAAAAGTTA AAGACGATGC CAGTCCATAC
AACGGCGACC TAATTTATTG GAGTTCAAGA ATGGGCAAAC ACCCTGAGAT GTCAACGCGA
ACGGCATTAC TGCTTAAAAA GCAAAAAGGG AAATGCGCTC ACTGCGGATT GTTCTTTAAA
GAGGGAGATG TAATTGAACT TGACCACATC ATTCCTAAGT CAAAAGGCGG AAAGAATGAA
TATAAAAACT GGCAACTTCT CCATCGACAT TGCCATGATG AAAAGACCAG AAATGATGGA
AGTTTAGATA GGAAACTATC ACATAAATCC ATCAAATTCC CTAAGAATTA TCGATGGGAA
AACGATATTC TGGTGACGTG CTAA
 
Protein sequence
MGCKPITESP EDINAQLNWI MIDWDDLEKR VYKLQKRIYK ASRRDDVKTV RRLQKTLTKS 
WAAKCLAVRR VTQDNQGKKT AGVDGVKSLT PKQRLNLIDK LKLGTKVKPT RRVWIPKPGT
EEKRPLGIPT MYDRALQGLV KLALEPEWEA KFEPNSYGFR PGRSCQDAIG AIFLAINKKA
KYVLDADIAK CFDRIDHEQL LNKLNTYPTL RKQIRAWLKA GVMDGKELFP TSEGTPQGGV
ISPLLANIAL HGMENEINKL AETFDMRGPD GKLLGKRDKR KSVSLIRYAD DFVILHEDIT
IVQRCKEFIS EWLKDMGLEL KPSKTRLAHT LEEYNKEKPG FDFLGFNVRQ HKVGKFNSGR
VKGKLLGFKT IITPSKESQK RHYKKIAETI EKHKGKAQAI LIRNLNPIIR GWCNYFSTVV
SQKVFERLWH LTVWKLIKWG LKRHRNKGRK FIVSKYFQNI GGNNWAFATR QEGKNPMRLL
QHSDTTITRY VKVKDDASPY NGDLIYWSSR MGKHPEMSTR TALLLKKQKG KCAHCGLFFK
EGDVIELDHI IPKSKGGKNE YKNWQLLHRH CHDEKTRNDG SLDRKLSHKS IKFPKNYRWE
NDILVTC