Gene Franean1_1310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1310 
Symbol 
ID5675675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1581372 
End bp1582658 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content66% 
IMG OID641240241 
ProductRNA-directed DNA polymerase 
Protein accessionYP_001505669 
Protein GI158313161 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.389336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTTCA CCGCGCTACT CCACCACGTG GACCTGGACC GCCTGGAGGC GGCGTACCGG 
GCGATCCGCC CGCAGGCCGC GCCCGGCGTG GACGGAGTGA CGTGGCGGGA GTATGGGCGG
GACCTTCAGG GTAACCTGCG GGACCTGCAC GCCCGGATCC ATTCGGGGCG TTACCGGGCG
AGTCCCTCGC GGAGGGTGTA CATCCCGAAG GCGGACGGGC GGCAGCGGCC GCTCGGTATC
GCCACGCTAG AGGACAAGAT TGTCCAGCGG GCGGTCGTCG AGGTGCTGAA CGCCATCTAC
GAGGAGGACT TCCTCGGCTT TTCGTACGGG TTTCGGCCGG GGCGAAGCCA GCACATGGCG
CTCGACGCGC TCGCGGTCGG GATCCAGCGG AAGAAGGTGA GCTGGGTGCT CGACCTGGAC
ATCCGGGATT TCTTTTCCAG CCTCAGTCAT CAATGGCTGG TCAAGTTCCT TGAGCACCGG
ATCGCGGACA AACGGATCCT GCGCCTGGTC CAGAAATGGC TGAGCGCGGG AGTCATCGAG
AACGGCGCGT GGTCACAGAC AATGGAAGGG TCACCGCAGG GGGCTTCGGT ATCGCCGCTG
CTCGCTAACG TGTACCTGCA CCACGTCTTT GACCTGTGGG TGCGGTGGTG GCGGAATCGC
CAGGCGCGTG GTGATGTGAT CACCGTGCGT TTTGCTGACG ACGCTGTCGC CGGCTTCGAG
TACGAGGATG ACGCGCGGCG GTTCCTTGTC GATCTTCGGG ACAGGTTCGC GAAGTTCGGC
CTGGGGTTGC ATCCCGACAA GACCCGGCTG ATCGAGTTCG GGCGGTTCGC CGCCCGGAAC
CGGTCGCGGC ATGGGCAGGG CAAACCCGAG ACGTTCAGCT TCCTGGGCTT CACGCACATC
TGCGCGACGG GCAAGCGGGG CTACTTCTGG GTGCGGCGGG TCACGGACAA GAGGCGGATG
GCGGCGAAGC CACGCGAGAT CAAGGTCGAA GCGAAGCGGC GCAGCCACCT ACCCATCCCC
GTCCAGGGGC AATGGTTGCG CAGCGTGGTC AACGGCCACC TGAACTGCTA TGCCGTGCCC
GGCAACATGA ACGCGACGGC TTCATTCCGC TACGAGGTGC TCCATGCCTG GCACAAGGCG
CTATCGCGCC GTAGTCAGCG CGGGCACCTG AACTGGGGAC GGATGGGGCC CATCGCGAAC
AGGTGGCTAC CGACCGCAAA GGTCCGACAT CCCCTGCCTA CCGTTCGGCT CGACGCCAAT
ACCCGAGGCA GGAGCCCAGT GCGGTAG
 
Protein sequence
MRFTALLHHV DLDRLEAAYR AIRPQAAPGV DGVTWREYGR DLQGNLRDLH ARIHSGRYRA 
SPSRRVYIPK ADGRQRPLGI ATLEDKIVQR AVVEVLNAIY EEDFLGFSYG FRPGRSQHMA
LDALAVGIQR KKVSWVLDLD IRDFFSSLSH QWLVKFLEHR IADKRILRLV QKWLSAGVIE
NGAWSQTMEG SPQGASVSPL LANVYLHHVF DLWVRWWRNR QARGDVITVR FADDAVAGFE
YEDDARRFLV DLRDRFAKFG LGLHPDKTRL IEFGRFAARN RSRHGQGKPE TFSFLGFTHI
CATGKRGYFW VRRVTDKRRM AAKPREIKVE AKRRSHLPIP VQGQWLRSVV NGHLNCYAVP
GNMNATASFR YEVLHAWHKA LSRRSQRGHL NWGRMGPIAN RWLPTAKVRH PLPTVRLDAN
TRGRSPVR