Gene Franean1_3186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3186 
Symbol 
ID5671562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3753177 
End bp3754928 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content63% 
IMG OID641242080 
Productintegrase catalytic region 
Protein accessionYP_001507500 
Protein GI158314992 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.605604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCCC CAGAGCACGA CGATTTGTGG CCGACACCAC AGGCCGACAG CCTGCCCTCC 
GGACAGCGGC CAATCCGTCC GGACCCAACC GACCACGACG CCGATTCGGC GGGCGCGACT
GACCGCCCAG TAGCCGGTCC ACATTCCCAT GCCTCAGGCA CCCCAAAACC TCGCCGTCAC
CGCCGGCGAC GAACCATCGC GATCATTCTT CTCGTTCTGC TCGCACTTTT CCCGGTCTTC
GACCGGATCA CGGTCTCAAT GGTCGAGAAG CAGGTCGTCA AACAACTCGA AACCGCCGTC
GCGGACACCC TGGACTGCAA TGCACCCCAA CCAGCCGTCA GGCGTGTCAA CATCGCCGGT
TTCCCGTTCC TCACCCAGGT GCTGCTCGGG AAGCCCAGGG ATGTCAGTCT GTCCATCGAC
GACCTGTCGA CACCCGGCCC CCGGATCTCC TCGCTGGACG CGAACGCCAA AGGCATCAAA
ATTCCGATCT ACAACATGAT CACCGGCGGT GATGGAAAAC TTTCTGTCGA CGAGGTACGG
GCCACGGTCA AGATGAGTTA CACAGACTTG AACGCCTATC TCGCCGGGAA GACGGGCCAC
CTGCAGGTGA AACCGGCAGA CGGCGGACGG AGCCTCAACA TCTCTGGAAC AGTAGACCTA
CCGCTGATCG GCTCCCAGCA GATTGACGGC GTCACCACCT TCCAGGTCCG CGACAATGAG
ATCGAGTTGA CACCATCCCA CCTCACGTTA CGCGGAGCGA TCAACCTCGA CTTTCTGGTC
CCGCTAGGCC AACTAATTCC CTCGATCCCG ATCCCGGTTG GGGAACTACC GTTCGAGGTA
AAGGTGGAGT CAGTGTCCAC CGGCTTGCTG AGCCGGGTGT CCCCGGGGCG GCCACGGGAG
GTCCCGGGGC GGGTACGGGC GCGGATCCTG GCGTTGACCA GGACCACTCC TCCACCGGAG
ACCGGACTGA GCCACTGGAC GAGCACCGAG ATGGCGCGGT ACCTGAAGCG CCGCGAAGGA
GTGTCGGTCT CGCACACCTT CGTGGCCCAG CTGTGGCGGG AGAACGATCT CCAGCCGCAC
CGGCACCGAG TCTTCAAGCT CTCGGCGGAC CCGGATTTCG AGGCCAAGGT GGAGGACGTC
GTCGGCCTCT ACCTTGATCC CCCCGAGGGC GCCGAGGTCC TGTCGATCGA CGAAAAGCCT
GGGGTGCAGG CACGCGACCG GACGCAGCCA CCGCGGCCGG TCGCCTCCGG CCGGGTCGCC
ACCCGCACGC ACGACTACCA GCGGAAGGGC ACGACCGACC TGTTCGCCGC CCTCGACGTC
GGGACGGGGC GGGTCACCGC CAGGTGCTTC CCCAGCCACA CCAGGGCCGA TTTCCTCACG
TTCATGGACC AGGTCATCGC GGAATACGGC GGTGCGGAGC TCCATGTCGT GGTCGACAAT
CTGGCCACCC ACTACGGCCC CGACGTCGAC ACATGGCTAC GCAGACACAA GAACGTCACG
TTCCATTTCA CCCCGTCCGG CGGTTCATGG CTCAACCAGG TCGAGAACTG GTTCGGTATT
CTCACCCGGC ACGCACTCCA GCACGGGGCG TTCGTCTCGG TCCAGGACCT CGTCAACACC
ATCAACAACT ATGTCAAGAA CTGGAACTGG GACGCCCATC CGTTCGAGTG GACAGCCACC
GCAGAAGAGA TCGTAGCCAA GGTGGAGGTA CTCCACCGGG AATTCAGGAA GCTGCTCGCC
AACAACTTGT GA
 
Protein sequence
MSPPEHDDLW PTPQADSLPS GQRPIRPDPT DHDADSAGAT DRPVAGPHSH ASGTPKPRRH 
RRRRTIAIIL LVLLALFPVF DRITVSMVEK QVVKQLETAV ADTLDCNAPQ PAVRRVNIAG
FPFLTQVLLG KPRDVSLSID DLSTPGPRIS SLDANAKGIK IPIYNMITGG DGKLSVDEVR
ATVKMSYTDL NAYLAGKTGH LQVKPADGGR SLNISGTVDL PLIGSQQIDG VTTFQVRDNE
IELTPSHLTL RGAINLDFLV PLGQLIPSIP IPVGELPFEV KVESVSTGLL SRVSPGRPRE
VPGRVRARIL ALTRTTPPPE TGLSHWTSTE MARYLKRREG VSVSHTFVAQ LWRENDLQPH
RHRVFKLSAD PDFEAKVEDV VGLYLDPPEG AEVLSIDEKP GVQARDRTQP PRPVASGRVA
TRTHDYQRKG TTDLFAALDV GTGRVTARCF PSHTRADFLT FMDQVIAEYG GAELHVVVDN
LATHYGPDVD TWLRRHKNVT FHFTPSGGSW LNQVENWFGI LTRHALQHGA FVSVQDLVNT
INNYVKNWNW DAHPFEWTAT AEEIVAKVEV LHREFRKLLA NNL