Gene Franean1_2815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2815 
Symbol 
ID5671204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3331236 
End bp3333365 
Gene Length2130 bp 
Protein Length709 aa 
Translation table11 
GC content72% 
IMG OID641241724 
Productphage integrase family protein 
Protein accessionYP_001507144 
Protein GI158314636 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.952182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG CGGCGCACCC GCAGGCGCAC CCCCAGGATG CCGTCGGCTG GCCGGACGAC 
GACACGATCG TGCTGGCCGG CCGGCCAGTG CGCCCCAGCA CCGACGAGAC GCTGCTGTCC
CGGTTCGGTG ACCTGGTCTG GCAGCTGAGC CCGGCACATC CCGACGCCCA CATCACCGTC
GCCGCGCTGG ACTGGCGGCG CTACCCGCCC CAGCTCGTCC GGCCGTTCAA GACGTTCTTC
CACACCGCAC TGGAGCAGCC CTACCCGGCG TCGCCGAACG TGCAGCGGCC CGGCGAGCGG
CCCAGCGTCG CCACGCTCAG CTACTGGTTC GTCGACCTGC TCGTCTTCGC CACCTGGCTC
GACGAACGCG CCGTCGGCCA CCTCAGCGAC ATCACCACCG CCGACCTAGA CGCCTACCGG
TCCCACGTCC TCGGGCTGGG CCGAAGCCCC GCCCGCGAGA CGGACCTCCT CGCCGCCGTC
CGCACCCTGT GGATCTACCG TGACCGTCTT CCGGCGGCGT GCCGGCTTCG GGTCTGCCCG
TGGGGCGGCC TGGCGGCCAA GGACCTGGTC CGCCTCCCAC CCGCCGGCCG GGAGAACGCC
ACCCCCCGGA TCGCCTCCGC GACGATGGAC GCCCTGCTCG CCTGGGCGCT ACGCATGGTC
GAAACCCTCG GCCCGGACAT CCGCGACGCC TGGCACGAGT TCCACGACCT CGACGCGGGC
AACCACCCCT CCCAGCAGAT CTACCAGGGG ATGGGCATCC CCGACCGCCT GCACCTGTTC
CTCCACCACG CCAGCAGACG CGGAACCTTG CTGCCCGGAC GGCACGACCC CGCCCGCGGC
ACCGCCGTCA ACGGCAGTCA CATCCTGCGC CTGGTCGGGG TCCCCCCGGA CAAGCGCGCC
GGGCTGCCCT CACGGCAGCG GGCGCTGCTG GAGAACGCGG GCGTACCGAT CAGCACCGAC
ACCACCGTCG GCAGGATCAC CGCACGCCTC GACGGCATCC CCTGGCGGCC CGGCCCGATC
AGCATCCGCG AGCTCCCCAC CCTGGTGCGG CTGCTCTATG CTTCCGCCTT CACCGTCATC
TGCTATCTAT CGGGCATGCG CCCCGGCGAG GTCCTCACCC TGCCCCACGG CTGCGCCGGC
AGCGACCCCC GCACCGACGA GCTGCTGCTG CACGGCCGGC GGGGCAAGGG CTACGACCGA
AGCCCCCTGA CCCCCGGGCA GGTCGAACCC GACCGACCCT GGGTAGTCGT CGCCCCGGTC
CACACCGCCG TGCGGATGCT GGAAAGCCTC GCCGACTTCC CGTTCCTGTT CCCCGCCAGC
CCGATCGCCG CCCACGCCGG CCGGGCCAAC ACCACCCACG CCCGCTCCAC CGCCGCGATC
AACCAGGACC TGGAAGACCT CGTCACCTGG GTCAACACGA CCTTCACCCG CCCGGACGGC
ACCCCGCCCA TTCCACCCGA CCCGACCAAG CACCTCCACG CCACCCGCTT CCGGCGCACC
CTCGCCCACA GCATCGTCCG CCGTCCCCGC GGCCTCATCG CCGCCGCCCT GCAATACGGA
CACGTCCGCG CCAAGGTCAC CCTGAGCTAC GCGGGCGCCG CGGACACCTC CTGGCTCGAT
GACCTGGCGG TCGAGCGCCT GGAGATGGTC ATGGAACAGA CGCAGACCGA TGCCCGGCTC
CTCGCCGACG GCGAGCACGT CAGCGGACCC GCCGCCACCG ACTACCGCAC CCGGATCGCT
CGGTTCCACG GCCGAGTCGT CAACCAGCCC CACAACGCCC GACGGCTCCT CGCCAGCACG
GACCCAGACA TCCACCACGG CGACGGCCTC ACCTGCGTCT ACCGCGCCGA GACCGCCGAA
TGCCGCCGCA TCCTCGCCCG ACAGGGGATC ACCGTCGACG GGCCGCAAGA GTCCCACTGC
CGGTCGACCT GCCGCAACCT CGCCTACACC GACCGCAGCA TTGACCAGCT GCGCTCCCGG
CTCGATCTCC TGGTCGCCAC CACCGGCGAC TCCCTGACGC CCCAGCCGCT CCGCGACCGA
GCGCACGCAC AGGCCCAGGC CGCCAGAGCC ACCATCGACC GGCACGTCTC ATCCTCCCCA
CATCGGGCAG ACCAGGCAGG CCAGCGATGA
 
Protein sequence
MTTAAHPQAH PQDAVGWPDD DTIVLAGRPV RPSTDETLLS RFGDLVWQLS PAHPDAHITV 
AALDWRRYPP QLVRPFKTFF HTALEQPYPA SPNVQRPGER PSVATLSYWF VDLLVFATWL
DERAVGHLSD ITTADLDAYR SHVLGLGRSP ARETDLLAAV RTLWIYRDRL PAACRLRVCP
WGGLAAKDLV RLPPAGRENA TPRIASATMD ALLAWALRMV ETLGPDIRDA WHEFHDLDAG
NHPSQQIYQG MGIPDRLHLF LHHASRRGTL LPGRHDPARG TAVNGSHILR LVGVPPDKRA
GLPSRQRALL ENAGVPISTD TTVGRITARL DGIPWRPGPI SIRELPTLVR LLYASAFTVI
CYLSGMRPGE VLTLPHGCAG SDPRTDELLL HGRRGKGYDR SPLTPGQVEP DRPWVVVAPV
HTAVRMLESL ADFPFLFPAS PIAAHAGRAN TTHARSTAAI NQDLEDLVTW VNTTFTRPDG
TPPIPPDPTK HLHATRFRRT LAHSIVRRPR GLIAAALQYG HVRAKVTLSY AGAADTSWLD
DLAVERLEMV MEQTQTDARL LADGEHVSGP AATDYRTRIA RFHGRVVNQP HNARRLLAST
DPDIHHGDGL TCVYRAETAE CRRILARQGI TVDGPQESHC RSTCRNLAYT DRSIDQLRSR
LDLLVATTGD SLTPQPLRDR AHAQAQAARA TIDRHVSSSP HRADQAGQR