Gene Franean1_4607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4607 
Symbol 
ID5672952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5489481 
End bp5491262 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content63% 
IMG OID641243468 
ProductRNA-directed DNA polymerase 
Protein accessionYP_001508884 
Protein GI158316376 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAGCG CCGCAACGGT GCTGGGTGTC CTGCGTGAAC GCGGCAAACG AGGGCTGCCG 
TGCGACGAAC TGTACCGGCA ACTGTTCAAC CCACAGATGT ATCTGCTGGC CTACGGGCGT
ATCTACACCA ACCATGGTGC GATGACACCC GGAGTCACAC AGGAAACCGT GGACGGCATG
TCCCAGGAGA GGATCGGTCG CATCATCGAT GCGATGCGCC ACGAACGCTA CCGATTCCAC
CCCGTCCGGC GAGTCCACAT CCCGAAAAAG AATGGGAAGA CCCGCCCGTT GGGGCTACCG
ACCTGGTCGG ACAAGCTCGT CGGGGAAGTT GTACGCCTGC TCCTGGAGGC CTACTACGAA
CCGACGTTCT CCGACCGGTC CCACGGGTTC CGGCCACGGC GAGGCTGCCA CACCGCTCTG
CGGGAGATAG CCAACACCTG GACCGGAACA GCCTGGTTCA TCGAAGGAGA CATCACCGAC
TGTTTCGGGT CTCTCAATCA CGACCTGATG ATCGGAATCC TGTCGGAGAG GGTCCGCGAT
AACCGGTTCC TGAGGCTGGT GCGTAACATG CTACGAGCCG GATACCTGGA GGACTGGAGG
TGGGGGGCGA CTCTGTCGGG GGCTCCCCAA GGGGGCGTGG CCTCGCCGAT CCTGTCCTCG
ATCTACCTGC ACAAGCTGGA CGAGTTCGTC GAGAAGATTC TGATTCCGGA GTACACCCGA
GGGGGCCGCA GGGCGCGTAA CCCTGCTTAC CTCGTTCTGC AGAACGAGCT GGCCAAGGCA
CGCCGACGTG GTGACCGGGG CCAAGCCCGG ATACTACGAC GGCGAATGGT CAGCCTGCCC
AGCTCCGATC CCGATGATCC GGGATATCGG CGGCTGCGTT ACTGCAGATA TGCGGATGAC
CATCTACTCG GGTTCACCGG ACCGAAGGCC GAAGCCGAGG AAATCAGACA GCGGCTGGCG
GAGTTCCTGC GTGACGACCT CAAGTTGGAA CTGTCCGCCG ACAAGACGCT GATAACCCAC
GCCCGTACCG GCGCGGCCCG TTTCCTCGGC TACGAGATCA CTGTCCAGCA CAACAGCAGC
AAGACCACCC GGCGCCGTCG ATCGGTCAAC GGGCAGGTCG CGCTACGGGT GCCACGCGAT
GTGATCAAAG CTAAAAGTGT CCCCTACCTG CACCGCGGAA AACCCGCTAA GCAGAAGGCC
CTGACCAACG GTAACGACTA CACCATCGTC GCCACCTACG GGGCCATCTA CCGGGGCATC
GTCCAGTACT ACCTGCTGGC CGGAGACGTC CACCGACTTC ACCGGCTGCG CTGGGTCATG
GAGACATCCA TGCTCAAGAC CCTGGCAGGC AAGCACCGCT CGTCGGTGTC GAAGATGGCA
GCCAAACACA AGGCCAAAAT CCAGACACCG CACGGGCCAC GCACCTGCTT CGAGGCACGC
ATCGAACGCG ACAGCAGAAA ACCACTGGTC GCACGGTTCG GTGACATCCC ACTCCGCCGG
CAGAAAACAG CGACGGTCTC TGACCGTCAG CCGACCCGGG TGGACTATCC GCACAAGGAA
CTCCTCACCA GGCTCCTCGC GGATATCTGC GAAGTCTGCC AGCGCACGGG CAACGTTGAA
GTTCACCACG TCCGCGCGCT CAAAGACCTC GCGGCACCCG GCCCTCTGCC GCCCCCGTGG
GTCACAGCCA TGGCAAACCG CAGACGCAAG ACACTCGTGG TCTGCGCTAC CTGCCACGGC
CAGATCCACA AGGGGCGGCC CGCCACACCG CTCACGCAGT AG
 
Protein sequence
MQSAATVLGV LRERGKRGLP CDELYRQLFN PQMYLLAYGR IYTNHGAMTP GVTQETVDGM 
SQERIGRIID AMRHERYRFH PVRRVHIPKK NGKTRPLGLP TWSDKLVGEV VRLLLEAYYE
PTFSDRSHGF RPRRGCHTAL REIANTWTGT AWFIEGDITD CFGSLNHDLM IGILSERVRD
NRFLRLVRNM LRAGYLEDWR WGATLSGAPQ GGVASPILSS IYLHKLDEFV EKILIPEYTR
GGRRARNPAY LVLQNELAKA RRRGDRGQAR ILRRRMVSLP SSDPDDPGYR RLRYCRYADD
HLLGFTGPKA EAEEIRQRLA EFLRDDLKLE LSADKTLITH ARTGAARFLG YEITVQHNSS
KTTRRRRSVN GQVALRVPRD VIKAKSVPYL HRGKPAKQKA LTNGNDYTIV ATYGAIYRGI
VQYYLLAGDV HRLHRLRWVM ETSMLKTLAG KHRSSVSKMA AKHKAKIQTP HGPRTCFEAR
IERDSRKPLV ARFGDIPLRR QKTATVSDRQ PTRVDYPHKE LLTRLLADIC EVCQRTGNVE
VHHVRALKDL AAPGPLPPPW VTAMANRRRK TLVVCATCHG QIHKGRPATP LTQ