Gene Franean1_3776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3776 
Symbol 
ID5672141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4477008 
End bp4478171 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content74% 
IMG OID641242657 
Productnuclease SbcCD, D subunit 
Protein accessionYP_001508077 
Protein GI158315569 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0285977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCC TGCACACCTC GGACTGGCAT CTCGGGAAGA CCCTCAAGGG GCGCAACCGG 
CTCGACGAGC AGCGCGCCGT GCTCGGGGAG ATCATCGGGA TCGCCCGGAA GCACGAGGTC
GACGCGGTGC TCGTCGCCGG GGACGTCTAC GACAGCGCGG CCCCGCCGGC GGCGGCCCAG
CAGCTCGCCG TGCAGGCGCT GGTCGCGTTG CGCGGCACCG GCGCCGAGGT GGTCGTGATA
GCCGGCAACC ACGACCACCA GCCGACCTTC GACGCCTACC GGCCGCTGAT GTCGGCCGCC
GGGATCACCC TGGTCGGGAC GCCGCGCACC GCCGCCGACG GCGGGGTGGT CACCTTCCGC
GCCCCCGGCA CCGGCGAACC GGTCACCGTG GCGGCGTTGC CGTTCGTCTC CCAGCGGTAC
GCCGTCCGCG CCGCCGACCT GGTGACCCAG ACGCCCGACC AGACCGCGGC CCGCTACGAC
CAGGTCGTGC GCGCCCTGAT GGAACAGTTG CGCTCCGGCT TCGACGACAA CGCGGTGAAC
CTCGTCCTGG CCCACCTGAC GGTGACGAAC GGCCTGTTCG GCGGCGGGGA GCGGATGGCC
CAGTCGATCT TCGAGTACCA CGTGTCCGCG GCGGCGTTCC CCGCGGACGC CCACTACGTG
GCGCTGGGGC ACCTGCACCG GCGGCAGACC CTCGCCGCGC CGTGCCCGGT GGTGTACTCC
GGCGCGCCGC TCGCGGTCGA CTTCGGCGAG CAGGAGAACA CGAACGTGGT GTGCCTCGTC
GAGGCGAGCC CCGGCGTCCC CGCCCAGGTC ACCGACATCG CGCTGACGGC CGGCCGGCGG
CTGCGGACGG TCCGGGGCAC CGTCGCGGAG CTTGCCGCGC AGGCCCCGGA GCTCGCCGAG
GACCTGCTGC GGGTGGTCGT GCGCGAACCG GCCCGCGCGG GGCTGCGCGA CGAGGTCCAG
GAGCTGCTGC CGAACGCGCT CGAGGTCGGG ATCGACCCGG AGCTCCGGGT GGCGCTCGGC
GGGTCGCGCC CGAGCGCGGC GGCGCTGGCG CGGCGCAGCC CGGAGGACCT GTTCCGGGAG
TTCTGCGCCG CCGAGCAGTT CGAGGATCCC CGGGTCGAGG CGCTGTTCGC CCGGCTGCAC
GACGACGTCT CCAGCGGCGA CTGA
 
Protein sequence
MKFLHTSDWH LGKTLKGRNR LDEQRAVLGE IIGIARKHEV DAVLVAGDVY DSAAPPAAAQ 
QLAVQALVAL RGTGAEVVVI AGNHDHQPTF DAYRPLMSAA GITLVGTPRT AADGGVVTFR
APGTGEPVTV AALPFVSQRY AVRAADLVTQ TPDQTAARYD QVVRALMEQL RSGFDDNAVN
LVLAHLTVTN GLFGGGERMA QSIFEYHVSA AAFPADAHYV ALGHLHRRQT LAAPCPVVYS
GAPLAVDFGE QENTNVVCLV EASPGVPAQV TDIALTAGRR LRTVRGTVAE LAAQAPELAE
DLLRVVVREP ARAGLRDEVQ ELLPNALEVG IDPELRVALG GSRPSAAALA RRSPEDLFRE
FCAAEQFEDP RVEALFARLH DDVSSGD