Gene Franean1_4157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4157 
Symbol 
ID5672512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4937187 
End bp4938287 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content75% 
IMG OID641243030 
Productintegrase domain-containing protein 
Protein accessionYP_001508447 
Protein GI158315939 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.118927 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.593362 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGGCG GGCAGCATGA CGGCGGTGAG ACCGTCCGGC GACAGGTCGG TCCCGGGCCG 
GCGCCAGACG TCGTCGACGC GACCGACATG GTGGACGCGG TGGAGATCGT CGCGGCCGGG
CCGGTGGCCG TGCCGGGCAC GGTGGGGCTC GACGGGCCGC TCGCCGGCCG GGTCGGGGAC
TACGCGCGGG CGTCCCGGTC GGCGGCGACC TGGCGTGCCT ACGACGCCGA CCTGCGTCAT
TTCCGGTCCT GGTGCGAGGG ACGTCCCGTA CCGCTGGTCG CCGTGCCCGC CTCCGCGGTG
ACGGTCGCCG GGTACATCAC CGAACTGGCG GACGCCGGGT ATGCGCCGTC GACGATCCGC
CGTCGGCTGG CTGCGATCTC GGTGGCGCAT CAGCTCGCCC ATGCCGAGAA TCCGACGGGG
TCGGCGGAGG TGTCAGCGGT GTGGAACGGG ATCCGCCGGT CGCGGGGGGT GCGCCCGGCG
CGCAAGGCCG CGTTGGACAC GACGTTGCTG TCGCGGGTCG TCGCCGGCCT CGACGACTCG
CAGTTGGCGG ATGTGCGGGA CAGGGCGCTG CTGCTGGTCG GGTTCGCCGG CTGTCTGCGT
CGCAGTGAGC TGGTCGGGTT GGACACCGCC GACCTGGTGG AGACCGACGA CGGGCTGGTC
GTGACGGTGC GCCGTTCCAA GACCGACCAG GAGTCCGCCG GTGCGCAGGT CGGGTTGGCG
TACGGGTCGT ACCGGCCGAC GTGCCCGGTG CGGGCGTGGC GGGGATGGGT GGCGGCCGCG
GCGGCGGCGG GGACGCCGCT GGCTGGCGGG GCGGCGTTTC GGGGGGTGAA CCGGCACGGG
CAGGTCGGCG CGGGCCGGCT CTACCCGGGG TCGGTGGCGC GGATCGTGCA GCGTCGGGTG
GCCGCGGCCG GGTTGGATCC GGCGGATTTC GCGGGGCATT CGCTGCGGTC GGGGTTCGCG
ACGGCGGCGG CGCGGGCCGG GGTGACGGAC CGGTCGATCA TGCGGCAGGG CCGGTGGCGG
TCGGCGGCGT CGTTGGAGTC GTATGTGCGG GCCGGGCGGC TGTTCGACGC GGACAACCCG
TCGGGTCGGG TCGGTCTGTG A
 
Protein sequence
MAGGQHDGGE TVRRQVGPGP APDVVDATDM VDAVEIVAAG PVAVPGTVGL DGPLAGRVGD 
YARASRSAAT WRAYDADLRH FRSWCEGRPV PLVAVPASAV TVAGYITELA DAGYAPSTIR
RRLAAISVAH QLAHAENPTG SAEVSAVWNG IRRSRGVRPA RKAALDTTLL SRVVAGLDDS
QLADVRDRAL LLVGFAGCLR RSELVGLDTA DLVETDDGLV VTVRRSKTDQ ESAGAQVGLA
YGSYRPTCPV RAWRGWVAAA AAAGTPLAGG AAFRGVNRHG QVGAGRLYPG SVARIVQRRV
AAAGLDPADF AGHSLRSGFA TAAARAGVTD RSIMRQGRWR SAASLESYVR AGRLFDADNP
SGRVGL