Gene Franean1_1813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1813 
Symbol 
ID5670215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2176173 
End bp2177984 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content76% 
IMG OID641240734 
Producthypothetical protein 
Protein accessionYP_001506157 
Protein GI158313649 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex
[COG2176] DNA polymerase III, alpha subunit (gram-positive type) 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0953081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0279118 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGCCCAG TCCCTGCGCC GACCGGCGGC GTGCCGCGCC AGGCCAGCCT GGCAGAGCTC 
GGCCGGCCGC TGTCGGACCT GACGTTCGTC GTCGTCGACC TGGAGACCAC CGGCGGCTCG
CCGGCGACGA GCGAGATCAC CGAGATCGGC GCCGTGCGGG TTCGGGGCGG CCAGATCCTC
GGTGAGATGT CGAGCCTGGT CCGGCCGTCG GCCCCCATCC CGGCCTTCAT CTCCGTGCTC
ACCGGCATCA CCGGCGCCAT GGTCGCCACC GCCCCGGGCA TCGGCGAGGT CGTGCCGACG
TTCCTGGAGT TCGCCCGGGG CGCGGTGCTC GTGGCTCACA ACGCCCCGTT CGACCTCGGC
TTCCTGCGCG CCGCCGCCAC GGCGTGCGGC TACCCGGCCC CGGCCTGGGA ACATCTCGAC
ACGGTGCGGA TCGCGCGCCG CGTCATCAGC CGCGACGAGA CCCGCGACTG CCGCCTGTCC
TCCCTGGCGG CCCTCTTCGG TAGCGCGACC CAGCCGAACC ACCGGGCGCT GGCCGATGCC
CGCGCGACGG TCGACGTCCT GCACGGGCTG TTCGAGCGGC TGGGCAACCT GGGTGTCACC
ACGATCGAGG ATCTGCACGA GTACAGCGCG CGGGTCTCCC CCGCCCAGCG GCGCAAGCGG
CACCTCGCCG ACGACCTGCC CACCGGCCCG GGGGTCTACG TGTTCCGCGA CGGCACGGGG
CGCCCGCTGT ACGTCGGCAC GTCCCGGTCG GTCCGCTCCC GGGTGCGTAC CTACTTCACG
GCCAGCGAGC CCCGCACCCG GATGGCGGAG ATGGTCGCGA TCGCCGAGCG GGTCGACGCG
ATCGAGTGCG CGCACGCGCT CGAGGCGGAG GTGCGCGAGC TGCGGCTGAT CGCCGAGTAC
AAACCGCCGT ACAACCGCCG CTCCCGGTTC CCCGAGCGGG CCGTCTACCT GCGGCTCACC
GACGAGCCGT TCCCCCGGCT CTCCCGGGTC CGCTCCGTCG GCGACGGTGT GACGTCGCTG
GGGCCGTTCG GCAGCGCGGC GGCGGCCGAG TCGGCGGCCA CGGCGCTGCT GGAGGCGATC
CCGCTGCGCC AGTGCTCGAC CCGGCTCTCG CCGCGCCGTC CGACGGCCGC GTGCGCGCTG
GCCGAGCTGG GGCGCTGCGG CGCGCCGTGC GACGGCCGGG AGGGGGTCGC CGAGTACGGC
CAACACGTCG CCACGGCGCG CGGCGCGATG ACCGCCGATC CGCGTCCCGT CGTGGACGTG
CTGGAGCGGC GCATCGCGCG GCTGTCCGCC GACCAGCGCT ATGAGGAGGC CGCCGGGGTC
CGCGACCGGC TCGCGGCCTA CGTGCGGGCC GTCGCGCGCA TGCAGCGGCT GACGGCGCTG
ACCTGCATCG ACGAGTTGGT CGCCGCCGCG CCGACCGCCG ACGCCGGGTG GGATCTCGCC
GTCGTCCGCC GTGGCCGGCT GGTGTCCGCG GCGTCGGTGC CGCGCGGCAC CGACCCGAGG
CCCTGGGTCG ACGCCGTGGT CGCCAGCGCG GAGACCGTCC GACCACTGCC CGGCCCCACC
CCGTGCGCCT CGGTCGAGGA GACGGAACGG ATCGGGCGGT GGCTGGCCGG GCCCGGCGTG
CGCCTGGTCC GGCTGGACGG CGAGTGGAGC TGGCCGGCGC ACGGCGCGAT CCGCGCGGCG
CGGCGGTTCG ACGTCCGCTT CGACGGCGGT TTGGACGGCG GGTTCGACCG CGGGTTCGAC
TCCCCCACCG ACACCCGACG CGGGCGCGCG CCCAGCAACC CCCGGAGCGG CCGCGAGCCG
CGGAAACGCT AG
 
Protein sequence
MCPVPAPTGG VPRQASLAEL GRPLSDLTFV VVDLETTGGS PATSEITEIG AVRVRGGQIL 
GEMSSLVRPS APIPAFISVL TGITGAMVAT APGIGEVVPT FLEFARGAVL VAHNAPFDLG
FLRAAATACG YPAPAWEHLD TVRIARRVIS RDETRDCRLS SLAALFGSAT QPNHRALADA
RATVDVLHGL FERLGNLGVT TIEDLHEYSA RVSPAQRRKR HLADDLPTGP GVYVFRDGTG
RPLYVGTSRS VRSRVRTYFT ASEPRTRMAE MVAIAERVDA IECAHALEAE VRELRLIAEY
KPPYNRRSRF PERAVYLRLT DEPFPRLSRV RSVGDGVTSL GPFGSAAAAE SAATALLEAI
PLRQCSTRLS PRRPTAACAL AELGRCGAPC DGREGVAEYG QHVATARGAM TADPRPVVDV
LERRIARLSA DQRYEEAAGV RDRLAAYVRA VARMQRLTAL TCIDELVAAA PTADAGWDLA
VVRRGRLVSA ASVPRGTDPR PWVDAVVASA ETVRPLPGPT PCASVEETER IGRWLAGPGV
RLVRLDGEWS WPAHGAIRAA RRFDVRFDGG LDGGFDRGFD SPTDTRRGRA PSNPRSGREP
RKR