Gene Franean1_5518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5518 
Symbol 
ID5675765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6686578 
End bp6688011 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content70% 
IMG OID641244374 
Productintegrase family protein 
Protein accessionYP_001509778 
Protein GI158317270 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0834932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACGA ACAGCGACAA CCACCCGACC ATGAGCCTGC AACAGGCACT CCGAGAGCAC 
CTCGACCGTG CCCGCGCCGG ACAGGGCCTG CTCTACCCAC AGCCCGAAGG AGAGACCAAG
TTCGCCTGCG ACATGGGCGA CAGCCGCTAC GGCTGCGCCT GGTGCGACAA CATCGAATGG
CCCGACGGCA ACCCCCCTGG ACGGCGAGTG ATGGCTGCGG GGGACGGTGC CGGGAACGAC
CGGCGGAAGC GGCGGCGCTC CCAGGGCGAA GGCGCCCTGT TCCAGCGGGC GAGCGACGGC
CTGTGGGTCG GCCGCGCCGA TCTCGGCTGG GTGGACGGGA AGCGTTCCCG CAAGACGGTC
TACGGCAAGA CCGAGAAGGA ATGCCGGGAG AAGCTCACCA AGGTCCAGCG CGCGGCCGAA
CTCGGCGTCA ACGTCACGGC TGAGCGCCGG ACGGTCGCGG TCTGGCTGGG GGAGTGGCTC
GACATCAAGG AGGGCGACGG CACCCGCGCG TCGACGCTCC GGGCGTACCG CTGGTTGATC
AACATGCACA TCGTGCCGGT GATCGGCCGT GTCCAGCTCG ACAAGCTGAC TCCGTTGGAC
GTCCGGCGGC TCGTCGCCTC GGCGAAGAAG TCGGGGCTGT CGGCGGGCAG CGTCCGCCAC
GTGCACAGCT TGATCCGCAA CGCTCTGGCG GAAGCCGAGC GGCTGGACCT GGTGGCGCGC
AACGTGGCGA AGGCGGTCAA GGCGCCACCC ACCCCGCATC GGGAGGTTCG GGCGCTGCGG
CCGGAAGAGG CGCGCCGGCT CGTCGAGGTG CTCCGTGGTG AGCGGCTCGA AGCGGTGTTC
GCGTGTGGGC TGATGCTCGG GCTACGCCGC GGGGAGATCC TCGGCCTGCG CTGGTCTGAT
GTCGACCTGG ACGGCGCGAC GCTCCACGTT CGCCAGACCC TGCAACGGGT CGACGGCTCG
CTGATGTTCG TCCCGGCGAA GACGGAGCGG TCCCACCGGC GGCTGCCCAT CCCGCCGAAG
CTGGTGACGA TCCTGCGGCG GCACCGGGCC ACCCAGACAG CGGAACGAAC CGGCCTCGGT
GACGCCTGGA CGGAAACCGG GCTGGTGTTC ACCTCGTCCA TCGGCACGCC GTTGGAACCA
AGGAACGTCA ACCGGCGGTT CGATGTGCTG CGCCGTCAGG CCGGGCTGCC GTGGCTACGG
CTGCATGATC TGCGCCACGC CTTCGCGTCG ATGCTCTTCG CTGAGGGTGT GCCGGCCCGG
ACGGTGATGG AGCTGCTCGG GCACTCCACG ATCCAGCTCA CCATGAACAC CTACACGCAC
GTGATGCCGG AGACCCAGCG CGACGCGGTC GGCCGGCTCG ACCGGATCTT CAACGACGAT
GCCGGCGATG TCGCCGACGT CGACGGAGCG GACGGGGAGG GCCTCGCGGG CTGA
 
Protein sequence
MPTNSDNHPT MSLQQALREH LDRARAGQGL LYPQPEGETK FACDMGDSRY GCAWCDNIEW 
PDGNPPGRRV MAAGDGAGND RRKRRRSQGE GALFQRASDG LWVGRADLGW VDGKRSRKTV
YGKTEKECRE KLTKVQRAAE LGVNVTAERR TVAVWLGEWL DIKEGDGTRA STLRAYRWLI
NMHIVPVIGR VQLDKLTPLD VRRLVASAKK SGLSAGSVRH VHSLIRNALA EAERLDLVAR
NVAKAVKAPP TPHREVRALR PEEARRLVEV LRGERLEAVF ACGLMLGLRR GEILGLRWSD
VDLDGATLHV RQTLQRVDGS LMFVPAKTER SHRRLPIPPK LVTILRRHRA TQTAERTGLG
DAWTETGLVF TSSIGTPLEP RNVNRRFDVL RRQAGLPWLR LHDLRHAFAS MLFAEGVPAR
TVMELLGHST IQLTMNTYTH VMPETQRDAV GRLDRIFNDD AGDVADVDGA DGEGLAG