Gene Franean1_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3066 
Symbol 
ID5671445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3620896 
End bp3622359 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content66% 
IMG OID641241964 
ProductRNA-directed DNA polymerase (Reverse transcriptase) 
Protein accessionYP_001507384 
Protein GI158314876 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGAGC GTGGGAAGTC GGATCGTCTT GTAGTACCTG CGAACCCGCC GAACAAGGCC 
ACAGCTGCGG AGGTGGGGGA GGGAAGGGGA CGAGCCAAGG GGAACACGGA CAGTAAAACG
CATCCCGGAC ACAGCGCCGG AACTGATGCG TCCAGTGCGC TGGGCCGTGT GCGTGAAGTG
GCACGACGGG ACAGGAACGC GCGGTTCACC GCGCTGCTGC ACCATGTCAC GCTGGGTCGG
CTCCGGGAGG CGTATCGGGC GATCAGCCCG AAAGCGGCTG CTGGGGTGGA CGGGGTGACG
TGGACCGACT ACGGGCAGGA CCTGGAGGCC AATCTGCAGG ATCTGCACGT GCGGGTGCAG
TCGGGATGTT ACCGGGCGAC ACCGTCGAGG CGGGCGTACA TACCGAAGGC GGACGGGCGG
CTTCGGCCGC TCGGGATCGC CTCGCTGGAG GACAAGATTG TTCAGCGGGC GGTTGTCGAG
GTGCTGGGCG CCGTCTACGA GGTGGACTTC CGGGGCTTCT CGTATGGGTT CCGGCCGGGG
CGGGGTCCGC ATGACGCGTT GGACGCCCTC GCGGTCGGGA TCTGGAGGAA GCGGGTGAAC
TGGGTGCTCG ACGCGGACAT CCGCGACTTT TTCGGCCAGA TTGATCATTC CTGGCTGCGG
AGGTTTCTGG AGCACCGGAT CGCGGACAAG CGGGTCCTGC GGCTGATCGA CAAGTGGTTG
GCCGCGGGGG TCGTCGAGGA TGGGGAGTGG ACAGCGTGTG AGGAAGGTTC GCCACAAGGG
GCGTCAGTGT CCCCGCTGCT GGCGAACGTC TACTTGCACT ATGTCCTCGA CCTGTGGGTC
GACTGGTGGC GGCGTCGCCA CGCGCGCGGA GATGTCATTG TCGTGCGCTG GGCCGACGAC
TTCATCGTCG GGTTCGAATA CGAGGAGGAT GCGCGGCGGT TCCTGGACGA GCTGCGCGAA
CGGTTCGCGA AGTTCGGGTT GGAACTGCAC CCGGATAAGA CGCGGCTGAT CGAGTTCGGG
CGGTACGCCG CCCGGGATCG GAAGCGGCGG GGTCTGGGCA AGCCGGAGAC GTTCGACTTT
CTGGGGTTCA CGCACATCTG TGCGACATCC CGGAGGGGGA CGTTCTGGCT CAAGCGCATC
ACGATCGCGA AACGCATGCG GGCGAAGCTG AAGGCGGTCA ATGAGCAGCT GAAGCGTCGC
CGGCATACGC CCATCCCGGA TCAGGGACGC TGGTTGGCGA GCGTGCTACG TGGGCATATG
GCCTACTACG CCGTGCCCGG CAACACCGAC ACGATGTCGG CCTTCCGTAC CCAGGTGACA
CGGCACTGGT GCAAGGCGCT GCGGCGCCGC AGCCAACGTG ACCGGATGAA CTGGCAACGG
ATGGGGCGGA TCGCGGCTCG ATGGCTACCC CCAGTCCGAG TGATGCATCC CTTCCCGGAG
AGACGCTTCG CAGCCAGAAC CTGA
 
Protein sequence
MHERGKSDRL VVPANPPNKA TAAEVGEGRG RAKGNTDSKT HPGHSAGTDA SSALGRVREV 
ARRDRNARFT ALLHHVTLGR LREAYRAISP KAAAGVDGVT WTDYGQDLEA NLQDLHVRVQ
SGCYRATPSR RAYIPKADGR LRPLGIASLE DKIVQRAVVE VLGAVYEVDF RGFSYGFRPG
RGPHDALDAL AVGIWRKRVN WVLDADIRDF FGQIDHSWLR RFLEHRIADK RVLRLIDKWL
AAGVVEDGEW TACEEGSPQG ASVSPLLANV YLHYVLDLWV DWWRRRHARG DVIVVRWADD
FIVGFEYEED ARRFLDELRE RFAKFGLELH PDKTRLIEFG RYAARDRKRR GLGKPETFDF
LGFTHICATS RRGTFWLKRI TIAKRMRAKL KAVNEQLKRR RHTPIPDQGR WLASVLRGHM
AYYAVPGNTD TMSAFRTQVT RHWCKALRRR SQRDRMNWQR MGRIAARWLP PVRVMHPFPE
RRFAART