Gene Franean1_2537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2537 
Symbol 
ID5675697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3016135 
End bp3017805 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content66% 
IMG OID641241453 
Productrecombinase 
Protein accessionYP_001506873 
Protein GI158314365 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGGT TCGCGTTCGA GGGTCGCTGC TCCACCGAGG ATCAGCAGGA TCCCGAGTCC 
TCACGGGCAT GGCAGATAAC CCGCGCCAAG GCGCTGATCG AGCCGCACGG CGGCGAGATT
GTCACGGAGT ACTTCGACGC CGGAAAGTCC CGGTCCATTC CTTGGCAGCG GCGCCCCATG
GCTAACGCCC TCCTCCAGGC CCTGAAAGAT CCCCAGCGCG GTTTCGAAGC GGTGGTCATC
GGCGAGCCGC AGCGCGCCTT CTACGGCAAC CAGTTCGGCC TGGTATTTCC CCTGTTCAGC
CACTACCAGG TGCCTCTGTG GGTGCCCGAG GTCGGTGGGC CGATCGATCC CGACAACGAG
GCCCACGACC TCATCATGTC GGTGTTCGGC GGGATGTCCA AAGGTGAGCG TAACCGGGTA
AAGATCCGGG TGCGTGCGGC GATGGCAGCC CAGGCCAAGG TAGAAGGCCG TTTCCTGGGC
GGCCGGCCCC CGTACGGGTA CCGGTTGATC GATCTGGGCC CGCATCCGAA TCCGTCCAAG
GCTGCGGACG GTCGGCAGCT CCACGGCCTC GCGCTGGACG AGGTGGCCGC ACCCGTCGTG
GTTCGGATCT TCGCCGAGTT CCTCCGCGGT CACGGCATCT TCGCGATCGC GGAAGGGCTC
ACCCGGGACG GCATCTCCAG TCCTTCCGCC CACGACCCTG CCCGCAACAG CCACCGCAGC
GGGAAGGCAT GGTCGAAAGG GGCTGTCCGC GCGATCCTGA CCAACCCCCG CTACACCGGC
CGGCAGGTCT GGAACCGCCA GCGCAAGGAC GAGGTGCTCC TCGACGTCGA GGACGTCAGC
CTCGGACACA CCACCAAACT GCGATGGAAC CCCGAAAACA CGTGGATCTG GTCAGAGGAG
ACGGTCCACC CGGAGATCAT CGACATCGAG ACCTTCACAC AGGCGCAGGA ACTTCTCGCG
GGCCGTGGAC GAGGAGCGGG TGACCAGAAG ACCCCCCGGA CGCGCCAGCC CTACGCCCTG
CGCGGGGCGG TCCATTGCGG TATCTGTAAT CGCAAGATGC AGGGACACAC CGTCCGCCGC
GCCACCTACT ACCGGTGCCG CTACCCGCAG GAATACGCCC TGGCGAACAC GGTCACCCAT
CCGGCGAACG TCTACGTCCG CGAGGACGTC CTCGTCCCGG CACTCGACGG CTGGCTCGCG
GACACCCTCA CACCGCCCCG GCTGGCCGAG ACCCTGGACG CCATGGTGGC CGCCCAGGCG
AGTCCATCCG TCGACGACCT GGCCGCGCAA CGAGCCCGCC AGACGATCGA GGAGAGCAAC
GCCAAGCTCA CGAAATACCG GGCGGCACTG GACGCAGGAG CGGACCCGGC AGTCGTGACC
GGATGGATCG CTCAGGTACA GGCTGAGAAG ACCGCTGCCG AGCGGGACCT TCGCGAAGCG
CAGGAGAGCG ACGTACGGCA GCTGACACGC GACGAGATCA GTAGCATGGT GGAGTCACTC
GGCGAGATCG CCAGCGCCCT AGCGGAGGCG GAGCCCGTCG AGAAGACGGA TCTGTACCGA
TCGCTCCAAC TACGCCTGAC TTACCATCCC ACAACCAACA CGGTAAGGGC CGACATGAAG
ATCGACACAA GTTACCGTGG GGTAATGGAT CGTGTCCGAG GGGGGACTTG A
 
Protein sequence
MIRFAFEGRC STEDQQDPES SRAWQITRAK ALIEPHGGEI VTEYFDAGKS RSIPWQRRPM 
ANALLQALKD PQRGFEAVVI GEPQRAFYGN QFGLVFPLFS HYQVPLWVPE VGGPIDPDNE
AHDLIMSVFG GMSKGERNRV KIRVRAAMAA QAKVEGRFLG GRPPYGYRLI DLGPHPNPSK
AADGRQLHGL ALDEVAAPVV VRIFAEFLRG HGIFAIAEGL TRDGISSPSA HDPARNSHRS
GKAWSKGAVR AILTNPRYTG RQVWNRQRKD EVLLDVEDVS LGHTTKLRWN PENTWIWSEE
TVHPEIIDIE TFTQAQELLA GRGRGAGDQK TPRTRQPYAL RGAVHCGICN RKMQGHTVRR
ATYYRCRYPQ EYALANTVTH PANVYVREDV LVPALDGWLA DTLTPPRLAE TLDAMVAAQA
SPSVDDLAAQ RARQTIEESN AKLTKYRAAL DAGADPAVVT GWIAQVQAEK TAAERDLREA
QESDVRQLTR DEISSMVESL GEIASALAEA EPVEKTDLYR SLQLRLTYHP TTNTVRADMK
IDTSYRGVMD RVRGGT