Gene Franean1_2540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2540 
Symbol 
ID5670934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3019940 
End bp3021007 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content66% 
IMG OID641241456 
Productintegrase family protein 
Protein accessionYP_001506876 
Protein GI158314368 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTCGCG ACCTGCGGGA CCAGCGGCCA TCGGCGGCGT CTGCCGACGA GTTGGCGATG 
TTCGAGACGG ACGTGCTCGC GGGGTTCGTG CTCGCCCGGT CGTCGGCCGG GCTGACGGAC
GGGACGATCC GTAGCGACGT CGGGAACCTG GAACAGATCC GCTCCTGGTT CGGCCGGGCG
CTGTGGGAGA TGGAGCCAGC CGACGCCGAC GTCTATTTCG GTCGGGTGCT GCGGGGTTCG
CCGAGTGGGA CACGGCTGGC CAGGGCGGCG GCACTGAGCA CGTACTTCGA GTTCGTGGAG
TTGCGCCACA AGGTCGAGAT TCACAGGATG ACCGGACGCG TCGTTCAGTG TCCGCTGGAT
GAGATGAACC GGCCGCGTGG AAGCAAGGAC GCGCGACTGC GGATTCCGCC GGTCGATCGC
GAGATTGCCG AGTTGTTCTC CGGCTGGTCG CGCGAATTGG CGACCTGCCG GAAGTTCGCT
CCGAGCGCTC GCAACTACAC CGCGGCGCGG CTGATGGCGG AGGTGGGACT GCGGGTGAAC
GAGGCCCGCT CGCTGGATCT CGCGGACATC CGATGGGAGC TGGGCCGCTT CGGCAAGCTC
CATGTCCGTC ACGGCAAGGG CGCGCACGGT TCGGGGCCGC GAGAACGGAT GGTGCCGCTG
ATCAACCATG CGGGGCAGAC GCTGCGCTGG TACGTCGAGG ACGTGTGGGG TCACTTCGAC
GAGGACCACA CCCGTCACGG CGCTCCGCTT TTCCCTTCCG AGCGGCGCAA TGTCGACGGT
GCACCCGCAC GTGTCGGCTA TGACGCGTTG CGTTCGGGGT TGGCCGCCGC CGCGGCCGAG
CACCTCCCCG CGTGGAAAAG CAGGCTCACT CCCCACATTC TGCGTCATTA CTGCGCTTCC
CAGATGTACC TCAACGGGAT CGATCTTGTT TCGATCCAAG AGATGCTCGG GCATTCCTGG
GTGGCTACGA CAATGCGTTA TGTTCATGTG CACCGCACCC GCATCGAGGA CGCCTGGATC
GCGGGGCAGG GGCGAGCCGC ACAGCGGTTG GAAGGGCTGG TCCAGTGA
 
Protein sequence
MVRDLRDQRP SAASADELAM FETDVLAGFV LARSSAGLTD GTIRSDVGNL EQIRSWFGRA 
LWEMEPADAD VYFGRVLRGS PSGTRLARAA ALSTYFEFVE LRHKVEIHRM TGRVVQCPLD
EMNRPRGSKD ARLRIPPVDR EIAELFSGWS RELATCRKFA PSARNYTAAR LMAEVGLRVN
EARSLDLADI RWELGRFGKL HVRHGKGAHG SGPRERMVPL INHAGQTLRW YVEDVWGHFD
EDHTRHGAPL FPSERRNVDG APARVGYDAL RSGLAAAAAE HLPAWKSRLT PHILRHYCAS
QMYLNGIDLV SIQEMLGHSW VATTMRYVHV HRTRIEDAWI AGQGRAAQRL EGLVQ