Gene Franean1_2542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2542 
Symbol 
ID5670936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3021334 
End bp3022998 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content69% 
IMG OID641241458 
Productintegrase domain-containing protein 
Protein accessionYP_001506878 
Protein GI158314370 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTCC ACGACAGACC GCGCGCCAAC AGTGCCAAGA CCGCGACCAG TTGTCTGTCG 
TGCCTGGCCT GGGGGTTGCC GTGTCGGCGA GGGAAGTGCC GAGCCTGCTG CGACTTCGTC
GCACCCCGCT ACGGCCATGC GGTCGGCGAG TGCGGTGCCT GCCGTCGGCG GGAGCCGCTG
AAGAAGGGGT TCTGCCGGTT GTGCTGGTGT CAGGCACGCC TGGAACGCAC GACGGGCACC
TACAAAATGC TGATGCCCCA CGTCCGTCTG GTCCGCCATC ATCAGCTGTT CTTCGCCGAC
ATGGCCGGCT ACCGAGACAC CCACACGGCC CCGCGACGTT TCGACGAGAC CGGTCGAAAA
CCCCCGCCGC CACCCGCTTA CCGGCCCGAG ACCCGCCACG TCCAGCCGGC TCTGTCCGAC
GACGTCGTTC GCCGGCACTA CCGGTACGGA CGTTACGACC TGCGCCGAGG ACCGGCGCCT
GACAACCCGT GGTTGGCCTG GGCGCTTTAC ATCGCCTATG ACCTCGCTGA GAAAAGGGGA
TGGGGATCGT TCGTCCGAGG TGGAATGCAA CGGACCCTGG TCATGCTGCT GGCCGGCCAC
ATCGACGGAG AACTGATCCG CGTCAGCGAC TTCTACGAGA CCGTGGCCGA GCACTCCACG
AACATCGACG ACACCATCGA GATCCTCACC GTCATGGACG TCGTCCTCGA TGACCGGGAA
CAGGCTTTCG ACCGGTGGCT GCGAGCTGAA CTCGACGGGC TCGCCCCAGC AATCCGACGG
GACACCCATA CCTGGACCAC GTTGCTGCAC AATGGCGGCC CGCGGAACCG AGCCCGTGCG
CCGGGCACCG CCGCTGACTA CCTGCGCACC ATCCGGCCCG CGCTGACGGC CTGGTCGACC
CGCTACGGCC ACCTGCGGGA GGTCACCCGT GACGACGTCA TCGCCTACCT CGAACCGCTG
ACCGGCCCGC CGCGGGAGAG GGCCACGACC GCCCTGCGGT CGCTGTTCGT ATGGGCGAAA
CGCGCCAATG TCATCTTCCG TAACCCCGCC GTCCGCCTGC GCGTCCCACA GCGAGCAGAC
CCGGTCTGGC AACCGCTGCG CCCGGACGAA CTCGCCCGCA CCGTCGCGAC GGCCACCACC
GTGCACGCCC GCCTGTTCGT CGCCCTCGCC GCCGTCCACG CCGCCCGCGT CGGACAGATC
CGCGCCCTGC AACTCGACGA CGTCGACCTC GGCAACCGAC GCATCACCAT CGCCGGCCAT
GACCGGCCAC TGGACGACCT CACCCACCAG ACCCTGCTGG CGTGGCTGGA GCACCGCCGG
ACCCGCTGGC CAGTCACCGC CAACCGCCAC CTGGTCATCA GCCCATGCAC CGCCGGAGGG
CTCGGACCCG TCAGCTACCC CTGCCTCGCC CGCCCGCTGC GCGGGCTGCC CGCGACCCTC
GACCGGCTCC GTGTTGACCG GCAGCTCGAG GAGGCACTTA CCTCCGGCGC CGACCCCCTG
CACGTCGCCG CGGTCTTCGG CGTCAGCGAC GCCACCGCGA TCCGCTACGC GGACAATGCC
CGTCAGCTCC TGGCGCGCCC TCACGAGAGC GGTCCCTCGG GATCGCCGCG AACCCAAGGG
TCCAGACCCG GCAAAGAGCC TGATCGACAC TTCAGTTCCC GCTGA
 
Protein sequence
MTLHDRPRAN SAKTATSCLS CLAWGLPCRR GKCRACCDFV APRYGHAVGE CGACRRREPL 
KKGFCRLCWC QARLERTTGT YKMLMPHVRL VRHHQLFFAD MAGYRDTHTA PRRFDETGRK
PPPPPAYRPE TRHVQPALSD DVVRRHYRYG RYDLRRGPAP DNPWLAWALY IAYDLAEKRG
WGSFVRGGMQ RTLVMLLAGH IDGELIRVSD FYETVAEHST NIDDTIEILT VMDVVLDDRE
QAFDRWLRAE LDGLAPAIRR DTHTWTTLLH NGGPRNRARA PGTAADYLRT IRPALTAWST
RYGHLREVTR DDVIAYLEPL TGPPRERATT ALRSLFVWAK RANVIFRNPA VRLRVPQRAD
PVWQPLRPDE LARTVATATT VHARLFVALA AVHAARVGQI RALQLDDVDL GNRRITIAGH
DRPLDDLTHQ TLLAWLEHRR TRWPVTANRH LVISPCTAGG LGPVSYPCLA RPLRGLPATL
DRLRVDRQLE EALTSGADPL HVAAVFGVSD ATAIRYADNA RQLLARPHES GPSGSPRTQG
SRPGKEPDRH FSSR