Gene Franean1_3413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3413 
Symbol 
ID5671784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4042777 
End bp4044030 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content61% 
IMG OID641242301 
ProductRNA-directed DNA polymerase (Reverse transcriptase) 
Protein accessionYP_001507721 
Protein GI158315213 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.344584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGGAC CAGCATCGCG GGGAAAGTCG TTTGAGATCC CCAAACAGCT GGTGTGGGAT 
GCCTGGCTGA AAGTGAAGGA AAACGGTGGG GCACCGGGGC CCGACGGAGT GACGGTCGAG
CAGTTCGAGG CGAACGTGAA GGATCGCCTG TACGTGCTGT GGAACCGCAT GTCGTCGGGG
TCGTACTTCC CCGGACCCGT CGGAGCGGTG GAGATCCCGA AGAAAGGTGT GAAAGGAGGA
GCAAGAACCC TCGGCATTCC CAATGTAGTA GATCGCGTAG CGCAGACGGT GCTAAAGCTG
GCTCTGGAGC CGAAGGTCGA GCCGGTGTTC CACCGGGACT CGTACGGCTA CAGACCAGGC
CGTTCGCAGC GCCAGGCGCT CGAGGTCTGC CGGAAGCGGT GCTGGTCGCA CGACTGGGTC
GTTGACTTGG ACGTGCGGAA GTTCTTCGAC ACCGTGCCGT GGGAAAAGCT GCTGAAGGCG
GTGGCGTACC ACACGGACCA GAAATGGGTC CTGATGTACG TGGAACGCTG TCTGAAAGCG
CCGACGAAGC ATGCCGACGG AACCCTGCAA GAGAGAACCA TGGGCACAGT CCAGGGTGGC
CCATTCTCCC CGCTGGCGGC TAACATCTAT CTGCACTGGG GCCTTGACGC CTGGATGGCG
CGCGAGTTCC CGACCGTTCC ATTCGAGCGG TGGGCGGACG ATGTTGTGTT TCACTGTGTG
AGCCTGGAAC AGGCCAGGGA AGTGCGGGAC GCGGTGGTGG CAAGGCTTGT CGAAGTCGGG
TTGGAAGCTC ACCCCGACAA GACCCGGATC GTGTACTGCA AGGACAGCAA CCGAGGTGGC
GACTATGAAA ACACGTCGTT CACGTTCCTG TCGTATACCT TCAGGCCACG AGTGGCATGG
AACGGCACCC AGAAGAAACG CTTCACCAGC TTCATCCCGG GTGCCGCGCC GGATCGGGTG
GCCTCGTTCA GCCGCGAAAT GCGCGACCTG AGGCTGCACA GGCGAACGAA CCTGACACTG
GATCAACTCG CCGCGGACAT CAACCCGAAA GTGGCGGGTT GGCTAGAATA TTTCACCATG
TTCTACCCGA GCGTGGTGCT ACCCATCGGC ACGCGCATTG ACAGCCATCT CGTGCGCTGG
GCGAGGAAGA AGTACAAACG GCTGACACGA AGTGAGCGTA GGGCGTGGGC ATGGCTCAAG
GGAGTCCGGG AACGGTCCCC TGACCTGTTT GCGCACTGGG CGTTGCGGTA CTGA
 
Protein sequence
MSGPASRGKS FEIPKQLVWD AWLKVKENGG APGPDGVTVE QFEANVKDRL YVLWNRMSSG 
SYFPGPVGAV EIPKKGVKGG ARTLGIPNVV DRVAQTVLKL ALEPKVEPVF HRDSYGYRPG
RSQRQALEVC RKRCWSHDWV VDLDVRKFFD TVPWEKLLKA VAYHTDQKWV LMYVERCLKA
PTKHADGTLQ ERTMGTVQGG PFSPLAANIY LHWGLDAWMA REFPTVPFER WADDVVFHCV
SLEQAREVRD AVVARLVEVG LEAHPDKTRI VYCKDSNRGG DYENTSFTFL SYTFRPRVAW
NGTQKKRFTS FIPGAAPDRV ASFSREMRDL RLHRRTNLTL DQLAADINPK VAGWLEYFTM
FYPSVVLPIG TRIDSHLVRW ARKKYKRLTR SERRAWAWLK GVRERSPDLF AHWALRY