Gene Franean1_0485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0485 
Symbol 
ID5668905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp569226 
End bp570305 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content67% 
IMG OID641239415 
Productintegrase catalytic region 
Protein accessionYP_001504853 
Protein GI158312345 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.444951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGTCC ACTGTGTCCG TACGCCTGCT GTATCTGATC TTCGTGCGGG TCTGCGGCTG 
GCTGGTCCTC CTCGGTCGTT CGTCGGCGTC CAAGGACATC GAGCTGCTGG TGTTGCGGCA
CGAGGTCACC ATGCTGCGCC GTACCCAGCC CAAGCCCCGG TGGGACTGGG CGGACCGGGC
GGTACTCGCC GCACTGATCC AGCTTTTGCC GAAGACACTG CGAGCGCACC GACTGGCCAC
CCCCGGCACC GTCCTACGGT GGCACCGCCG TCTAATCACA CGGAAATGGA CCTACCCGCA
CCGGACAGGA CGACCGCCGG TCAGCACGGA GATCGCGACC CTCATCGAGC GGCTCGCGAC
CGAGAACACG ACGTGGGGAT ACCAGCGGAC CCAGGGCGAG CTCCTCACAC TCGGCCACCG
CATTGGCGCG TCCACGATCG CCGGGTCCTG ACGTCCCTGG GGCTGCCCCC GGCACCGAAA
CACCAGACCG ACACGACGTG GCGGCAGTTC CTGCGCACCC AGGCATCGAC GATGCTGGCG
GTCGACTTCT TCCACGTGGA CTGCGCCGTG ACGCTGCGGC GTCTGTACTG CTTCTTCGTC
CTGGAAGCCG GCTCCCGCTC CGTCCACATC CTCGGGGTCA CCGCCCACCC GGACGGGCTG
TGGACCACCC AACAGATCCG CAACCTCCTC ATGGACCTCG GCGCCCGGAC AGCCGACTTC
CAGTTCCTGA TCCGCGACCG CGCCGGGCAG TTCACCGCGT CCTTCGACGC GGTCCTCGCC
GACGCCGGCA TCACCACCGT CAAGATCCCA CCCCGGACGC CCCGGGCGAA CGCCTACGCC
GAACGGTTCG TCCACACAGT CCGGACCGAG GTCACCGACC ACATGCTGAT CGTCGGTGAA
CGGCACCTAC GTTCTGTCCT GGCCGAGTAC GCCGCCCACT ACAACGGACG ACGACCCCAC
CGCAGCCGCG ACCTTCAACC ACCACGACCC GACCACCCCA TCGCCGACCT GACCAAGGAA
CGGATCAAGC GCCGCCCCGT CCTCGACGGC CTGATCAACG AATACGAACG AGCCACCTAA
 
Protein sequence
MMVHCVRTPA VSDLRAGLRL AGPPRSFVGV QGHRAAGVAA RGHHAAPYPA QAPVGLGGPG 
GTRRTDPAFA EDTASAPTGH PRHRPTVAPP SNHTEMDLPA PDRTTAGQHG DRDPHRAARD
REHDVGIPAD PGRAPHTRPP HWRVHDRRVL TSLGLPPAPK HQTDTTWRQF LRTQASTMLA
VDFFHVDCAV TLRRLYCFFV LEAGSRSVHI LGVTAHPDGL WTTQQIRNLL MDLGARTADF
QFLIRDRAGQ FTASFDAVLA DAGITTVKIP PRTPRANAYA ERFVHTVRTE VTDHMLIVGE
RHLRSVLAEY AAHYNGRRPH RSRDLQPPRP DHPIADLTKE RIKRRPVLDG LINEYERAT