Gene Franean1_0478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0478 
Symbol 
ID5668898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp563583 
End bp564650 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content68% 
IMG OID641239408 
Productintegrase catalytic region 
Protein accessionYP_001504846 
Protein GI158312338 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTAC GCCTGCTGTA CCTGATCTTC GTTCGGGTCT GTGGCTGGCT GGTTCTTCTC 
GGCCGTTCGT CGGCGTCGAA GGACATCGAG TTGCTGGTGC TGCGGCATGA GGTCGCGGTG
CTGCGCCGTA CCCAGCCCAA GCCCCGGTGG GACTGGGCGG ACCGGGCGGT CCTCGCCGCA
CTCATCCGGC TCCTGCCCAG GGCGCTGCGA GCGCACCGGC TGGTCACGCC CGGCACCGTC
CTACGGTGGC ACCGCCGCCT GATCACACGG AAATGGACCC ACCCGCAGCG GACCGGACGG
CCACCGATCG GCACGGAGAT CGCCACGCTG ATCGAGCGGC TCGCGACCGA GAACACGACA
TGGGGCTACC AGCGAATCCA GGGCGAGCTC CTCACACTCG GCCACCGGGT GAGCGCCTCC
ACGATCCGCC GGGTCCTGAA GACCCTCGGG CTGCCCCCGG CACCGAAACG GCAGACCGAC
ACGACGTGGC GACAGTTCCT GCGTACACAG GCATCGACCA TGCTGGCCGT CGACTTCTTC
CACGTGGACT GCGCCGTGAC ACTCCGGCGT CTGCACTGCT TCTTCGTCAT AGAGGTCGAC
TCCCGCACCG TCCACATCCT CGGAGTCACC GCCCACCCCG ACGGACCATG GACCACCCAA
CAAGCCCGGA ACCTCCTCAT GGACCTCGGT GATCAGGCGG CCGACTTCCA GTTCCTGATC
CGCGACCGCG CCGGCCAGTT CACCGCGTCG TTCGACACGG TCCTCGCCGA CGCCGGCATC
ACCGCCGTCA CGATCCCACC CCGGGCTCCC CGGGCGAACG CCTACGCGGA ACGGTTCGTC
CGCACCGTCC GGACCGAGGT CACCGACCGC ATGCTGATCG TCGGCGAGCG GCATCTGCGC
ATGGTCCTGG CCGAGTACGC ACGGCACTAC AACGGACGAC GACCCCACCG CGGCCGCGAC
CTTCAACCAC CCCGGCCCGA CCACCCCGTC GCAGACCTGA CCCAGGAACG GATCAAGCGC
CAGCCCGTCC TCGGTGGCTT GATCAACGAA TACGAACGAG CCGCCTAA
 
Protein sequence
MSVRLLYLIF VRVCGWLVLL GRSSASKDIE LLVLRHEVAV LRRTQPKPRW DWADRAVLAA 
LIRLLPRALR AHRLVTPGTV LRWHRRLITR KWTHPQRTGR PPIGTEIATL IERLATENTT
WGYQRIQGEL LTLGHRVSAS TIRRVLKTLG LPPAPKRQTD TTWRQFLRTQ ASTMLAVDFF
HVDCAVTLRR LHCFFVIEVD SRTVHILGVT AHPDGPWTTQ QARNLLMDLG DQAADFQFLI
RDRAGQFTAS FDTVLADAGI TAVTIPPRAP RANAYAERFV RTVRTEVTDR MLIVGERHLR
MVLAEYARHY NGRRPHRGRD LQPPRPDHPV ADLTQERIKR QPVLGGLINE YERAA