Gene Franean1_3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3035 
Symbol 
ID5671415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3568885 
End bp3569952 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content68% 
IMG OID641241934 
Productintegrase catalytic region 
Protein accessionYP_001507354 
Protein GI158314846 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTAC GCCTGCTCTA TCTGATCTTC GTGCGGGTAT GTGGCTGGCT GGTTCTCCTC 
GGCCGCTCGT CGGCATCGAA GGACATCGAG CTGCTCGTGC TGCGGCACGA GGTCGCGGTG
CTGCGCCGCA CCCAGCCCAA GCCCCGGTGG GACTGGGCAG ACCGGGCGGT CCTCGCCACA
CTGATCCGAC TCCTACCCAG GGCCCTGCGA GCGCACCGGC TGGTCACCCC CGGCACCGTC
CTCGGGTGGC ACCGCCGTCT CATCACACGG AAATGGACCC ACCCGCAGCG GACCGGACGG
CCACCGATCA GCCCGGAGAT CGCCACGCTG ATCAAGCGGC TCGCGACCGA GAACACGACG
TGGGGCTACC AGCGAATCCA GGGCGAGCTC CTCAAGCTCG GCCACCGGGT CGGTGCGTCC
ACGATCCGCC GGGTCCTGAA GTCCCTGGGT CTCCCGCCGG CGCCCAGGCG GCAGACCGAC
ACGACCTGGC GGCAGTTCCT ACGCGCCCAA GCCTCGACCA TGCTGGCAGT CGACTTCTTC
CATGTGGACT GCGCCGTGAC GCTGCGGCGT CTGTACTGCT TCTTCGTCCT GGAGGTCGGC
TCCCGCACCG TGCACATCCT CGGGGTCACC GCCCACCCGG ACGGGTCGTG GACCACCCAG
CAGATTCGGA ACTTCCTGAT GGACCTCGGC GACCGGGCAG GCGACTTCCA GTTCCTGGTC
CGCGACCGGG CCGGACAATT CACCGCCTCC TTCGACGCGG TCCTCGCCGA CGCCGGCATC
ACAGCCGTCA AGATCCCACC CCGAACCCCA CGGGCGAACG CCTACGCTGA GCGGTTCGTC
CGCACCGTCC GGACCGAGGT CACCGACCGG ATGCTGATCT TCGGCGAACG GCACCTGCGT
ACCATCCTGG CCGAGTACGC GGCCCACTAC AACGGACGGC GACCCCACCG CAGCCGCGAC
CTTCAACCAC CCCGACCCGA CCACCCCATC GCAAACCTGA CCAAGGAACG GATCAAACGT
CGGCCTGTCC TCGGCGGCTT GATCAACGAA TACGAACGAG CTGCCTAA
 
Protein sequence
MSVRLLYLIF VRVCGWLVLL GRSSASKDIE LLVLRHEVAV LRRTQPKPRW DWADRAVLAT 
LIRLLPRALR AHRLVTPGTV LGWHRRLITR KWTHPQRTGR PPISPEIATL IKRLATENTT
WGYQRIQGEL LKLGHRVGAS TIRRVLKSLG LPPAPRRQTD TTWRQFLRAQ ASTMLAVDFF
HVDCAVTLRR LYCFFVLEVG SRTVHILGVT AHPDGSWTTQ QIRNFLMDLG DRAGDFQFLV
RDRAGQFTAS FDAVLADAGI TAVKIPPRTP RANAYAERFV RTVRTEVTDR MLIFGERHLR
TILAEYAAHY NGRRPHRSRD LQPPRPDHPI ANLTKERIKR RPVLGGLINE YERAA