Gene Franean1_1394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1394 
Symbol 
ID5669801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1690153 
End bp1691343 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content67% 
IMG OID641240319 
Productintegrase family protein 
Protein accessionYP_001505746 
Protein GI158313238 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0589061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGCA CCGGCAGACC ACCCCTCGGC CTCGGAACGT ACGGTGAGAT CCGGGTCTAC 
AAGATGGACT CCGGGCGTTA CAAGGCCCGC ACGCTCTACC GCGACTTCGA CGGCGTGACC
CGCCCGGTCG CCCGGAACGG CGCGAGCAAG AACGCCGCGG AGACGGCCCT CAAGAACCAC
CTCCGTGACC GCGTCCGGGA GGCCGGAGCC GAGGCAGAGA TCACCGCGGG GTCGACCGTC
GAGGTGCTCG CCGAGGCGTG GTGGGCCGAG TTCTCCAAGC AGGACAAGTC CCCCGGCACC
TTCCGCCTCT ACCGCGACCG GCTGGACAAC CAGATCATCC CGGCACTCGG AAAGATCCGG
ATCCGGGAGC TCACGACTGG GGCCGCCAAC CGGCACATCA GTACGGTCAG CCAGAACAAC
GGCGCTGGCG TGGCCAAGGC AACCCGCACG GTCCTCAGCA ACATGTGTGC CTTCGCCTGC
CAGCGCGACG TGATGAAGAC CAACCCCATC CGCGAGGTGG CCCCGGTCCG GCCGAAGGCC
AAGAAGGTGC CGAAGGCTCT CAGCGTCGCC GAGCTCCAGC AACTCCGCGC ATTGTTCACT
TACGACCCCG CCGCGGTGCG CCGAGACATC CCCATGCTGT CGAGCATCCT GCTGGCGACC
GGCGTGCGGA TCGGAGAGTG TCTGGCGTTC GTCGAGGACG CCCTCGACCC CAAGGAGGGC
TCGATCGAGG TGCGCGGGAC GGTGATCTGG CTCAAGGGGG TCGGACCCAT CGTCAAGCCC
GCACCGAAGA GCGCCGCGGG CTTCCGGCGG CTCCTGCTAC CAAAGTGGGC CGTCAACCTG
CTTCGGTCCA GGTTCGAGGA GTCAGCCGTG ATCAGCAAGC CAGTGCCGGT GCTGAACGGC
GAGGCATGGG ACTCCCCGCT GGCGTTCCCC ACATCCACAG GGCGGCTGCG GGACATCACC
AATGTCGAGA GTTACTGGCG AGAAGCCGTC ACCACCGCAG GATTCGACTG GGTGGTGCCC
CACACTTTCC GCAAGACCGT CGCCACCGAG ATGGACCGCG CGGGTCGGAC AGCACGCGAA
ATCGCGGACC AGCTCGGCCA TTCTCAGATC ACACTGGTAC ACAATACTTA CCTAGGCCGC
AAAGCCCGTG ACACCGGCGC CGCCGCAGCT CTTGAAGGGC TGGTCGCATG A
 
Protein sequence
MPRTGRPPLG LGTYGEIRVY KMDSGRYKAR TLYRDFDGVT RPVARNGASK NAAETALKNH 
LRDRVREAGA EAEITAGSTV EVLAEAWWAE FSKQDKSPGT FRLYRDRLDN QIIPALGKIR
IRELTTGAAN RHISTVSQNN GAGVAKATRT VLSNMCAFAC QRDVMKTNPI REVAPVRPKA
KKVPKALSVA ELQQLRALFT YDPAAVRRDI PMLSSILLAT GVRIGECLAF VEDALDPKEG
SIEVRGTVIW LKGVGPIVKP APKSAAGFRR LLLPKWAVNL LRSRFEESAV ISKPVPVLNG
EAWDSPLAFP TSTGRLRDIT NVESYWREAV TTAGFDWVVP HTFRKTVATE MDRAGRTARE
IADQLGHSQI TLVHNTYLGR KARDTGAAAA LEGLVA