Gene Haur_1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1921 
Symbol 
ID5733810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2317173 
End bp2318960 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content50% 
IMG OID641279065 
ProductRNA-directed DNA polymerase 
Protein accessionYP_001544692 
Protein GI159898445 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCACAG CCGACACTAT TTTGGCAGTC ATTCGTGATC GTGGGCAACG AGGATTGCCC 
TTAGCGCGTG TCTATCGCAT ATTGTTCAAT GAGGATCTCT ATTTGCGTGC CTATGGGCGG
CTAGCAACCA AACAAGGCGC ACTCACCAAA GGAAGTACAG ACGAAACTAT TGATGGAATG
TCAATGGCGA AAATTCATCG TATCATCGCC GATCTTCGCC GCGAGACCTA TCGCTGGACT
CCAGTACGAC GGGTCTACAT CCCAAAAGCA ACGGGAAAAA CCCGACCTTT AGGGGTTCCT
ACATGGTCAG ATAAGTTGGT TCAGGAAGTC TTACGCTCCA TTCTCGACGC GTATTACGAC
CCCCAAATGA GCGACCATTC ACATGGTTTT CGTCCGAACC GTGGATGCCA TACGGCTCTT
AAAGCCATTC AGCGTTGCTG GACGGGCACA CGGTGGTTTA TTGAGGGTGA TATTGCACAA
TATTTTGACA CGATCAATCA CACGACACTC CTCACAATAT TGGCGAAACG CATCCACGAT
GGCCGTTTTC TTCGGTTGAT CCAGACACTC TTGCAGGCAG GATATCTTCA CGATTGGGTG
TATCATCCGA CGCTGAGTGG AACACCACAA GGCGGGGTAA TCTCCCCACT CTTGGCAAAC
ATTTACCTGC ATGAATTTGA CCAGTTTGTC GAACATACGC TGATACCTGC CTATACCAAA
GGGCAGAGAC GGAAAGTCAA TCCGGCCTAT GCACAGATGG AACAACGAAT CAGCAAATTA
CGTCGTCAAC GGGAGTACGC AAGCGTTACC CCGCTCCTGA AGGAGCTTCG CACCCTGCCC
TCCCGTGATG TGCATGATCC TGATTACCGA CGGTTACGCT ATGTGCGGTA TGCCGATGAC
TTTCTCCTTG GGTTTGCGGG AACAAAAGTG GAAGCAGAGG CGATCAAACA GCAGATTAAT
GTATGGCTGT ATGATCATCT CCAATTAAAA CTGTCCACTC AGAAAACGCT GATCACGCAC
GCAAGTAGTG ATCCAGCCCA TTTTCTTGGC TATGACATCG TGACGCAACA GGCAAATAGC
AAACAGACGG GAAACCGACG CATTGTGAAT GGCCGGATCG CGTTACGCGT AGCCCGCGCA
ACGATTACTG CGAAATGCAA CCGCTATATG AAAAACGGGA AGGCTACCCA CCGACCCGAA
CTGCTGAGCG AAACCGACTT TACTATTATT GCAACGTATC AGCAGGAATA TCGTGGCATT
GTGCAATATT ACATGCTTGC CCACAATGTA TCGCATTTGC ACCGCCTGCA TTGGGTTATG
AAGCAATCCT TGCTCAAAAC ACTGGCTGCC AAACACAAAA CAACTAGTGC CGTGATGCGA
AGGAAATATC TGGCAACACA GCAGTTGCCT GACGGGCGCA GAATGCTCTG CATTCGCGTA
TTCGTCGAGC AACCAGCACG ACCGCCACTC ATAGCTCAAT TTGGTGGGAT TTCTTTGCGC
CGCAATCCAA TGGCAATCCT CAATGAACGT CCACCCCAAC TATGGAATGT GGGAACCGAA
ATTATCCAAC GCTTGAAAGC GCAAGAATGC GAGCTATGTG GAAGCCATGA AGATGTGGAA
GTCCACCATA TCCGCAGGCT TGCCAACCTA AAGCGATCGG GACGAACGGA AAAACCACAA
TGGATGCAAC GCATGATTAC GCGGCAACGA AAAACGCTCG TAGTCTGTTC TCAATGTCAC
CACCGCATCC ATGCTGGAAA AACACTACCA CAGCAAATCA CGAAATAA
 
Protein sequence
MRTADTILAV IRDRGQRGLP LARVYRILFN EDLYLRAYGR LATKQGALTK GSTDETIDGM 
SMAKIHRIIA DLRRETYRWT PVRRVYIPKA TGKTRPLGVP TWSDKLVQEV LRSILDAYYD
PQMSDHSHGF RPNRGCHTAL KAIQRCWTGT RWFIEGDIAQ YFDTINHTTL LTILAKRIHD
GRFLRLIQTL LQAGYLHDWV YHPTLSGTPQ GGVISPLLAN IYLHEFDQFV EHTLIPAYTK
GQRRKVNPAY AQMEQRISKL RRQREYASVT PLLKELRTLP SRDVHDPDYR RLRYVRYADD
FLLGFAGTKV EAEAIKQQIN VWLYDHLQLK LSTQKTLITH ASSDPAHFLG YDIVTQQANS
KQTGNRRIVN GRIALRVARA TITAKCNRYM KNGKATHRPE LLSETDFTII ATYQQEYRGI
VQYYMLAHNV SHLHRLHWVM KQSLLKTLAA KHKTTSAVMR RKYLATQQLP DGRRMLCIRV
FVEQPARPPL IAQFGGISLR RNPMAILNER PPQLWNVGTE IIQRLKAQEC ELCGSHEDVE
VHHIRRLANL KRSGRTEKPQ WMQRMITRQR KTLVVCSQCH HRIHAGKTLP QQITK