Gene Francci3_1331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1331 
Symbol 
ID3906603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1597632 
End bp1598915 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content74% 
IMG OID637878664 
Product3'-5' exonuclease 
Protein accessionYP_480437 
Protein GI86740037 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0349] Ribonuclease D 
TIGRFAM ID[TIGR01388] ribonuclease D 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0588384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.960366 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGGTAC AGCCCAGCAC CGGTGCTCAT GAGCCCATCC CGCTGCTCAT CCCTGCGGAC 
GGGGTACCCC CCGTCATCGT TGACACCGCG GAACTCGCCG CGGCGGCGGA ACGCCTCGCG
GCCGGCATCG GGCCGGTCGC GTTCGACGCC GAACGTGCGT CCGGTTACCG GTACAGCCAG
CGGGCCTACC TCGTCCAGAT CCGACGGCGG GGCACGGGCT CGCTGCTCCT CGATCCGATC
GCGCTCGAGG ATCTGAGCGT CATCCAGGAC GCGGTGGGCG GGGTCGAATG GGTGCTGCAC
GCCGCCAGCC AGGACCTTCC GTGCCTGTCC GAGCTCGGTC TGCGGCCGAG TCTGCTGTTC
GACACGGAGC TCGCCGGCCG GTTGCTGGGC TACGAACGGG TCGGGCTGGG GATAATGGTC
GAACGGGTGC TGGGCTACGG GCTGGAGAAG GGCCATTCGG CCGCGGACTG GTCGACCCGC
CCGCTTCCCG AGCCGTGGCT GCGCTACGCG GCACTCGACG TCGAGCTGCT GGTGGAGTTG
CGCGACGCGC TCGAGGCGGA GCTGATCGAA CAGAACAAGA TCGAGTTTGC CCGGCAGGAG
TTCGCCGCGA TCGTCGCGGC CCCGCCACGC GAGCCACGGG CCGAGCCGTG GCGCCGGACG
AGCGGGATCC ACCGAGCCCG GTCCCGCCGC CAGCTGGCGG CGGTCCGGGC GATGTGGACC
GCCCGGGACC GGCTAGCTCG TACCCGTGAC GTCGCGCCGG GTCGCGTCCT CCCGGACAGC
GCGATCATGG ACGCCGTGCT GAACGCCCCC ACCGACGCGG CCGCGCTGGT CCGGCTGCCC
ATCTTCTCCG GACCACGGAT CCGCCGCACC GCGAACGTCT GGCTCGACGC CCTGCGTGGC
GCCGCCGCAC TGCCCGAGGA GGAGCTGCCC GCCCCGGCCG GACCGGGCGG CGACGGTCTG
CCGCCGCCCA ACCGCTGGGC CGAGCGCGAC CCGGTGGCCG CGTCGCGGCT GGCCCGGGTG
CGGGCCGCGC TGGCCGCCCT CGCCGCCGCC CACACGATGC CGGTGGAAAA CCTCCTCGAG
CCCGCGTTGT CCCGGCGGCT CGCCTGGTCC CCGCCGAATC CCCTGACGGA CACGGCCGTC
GCCGGGGCGC TGCGGACCGG CGGGGCACGC CCCTGGCAGA TCAAGCTGAC GGCCGCTCCC
CTGCTCACCG TCCTCGCCGA GCCGCCGGCC GAACCGACGG AACCGGCCGA ACCGGCCGAA
CCGGCAGCCG AGCCGGCGGG CTGA
 
Protein sequence
MEVQPSTGAH EPIPLLIPAD GVPPVIVDTA ELAAAAERLA AGIGPVAFDA ERASGYRYSQ 
RAYLVQIRRR GTGSLLLDPI ALEDLSVIQD AVGGVEWVLH AASQDLPCLS ELGLRPSLLF
DTELAGRLLG YERVGLGIMV ERVLGYGLEK GHSAADWSTR PLPEPWLRYA ALDVELLVEL
RDALEAELIE QNKIEFARQE FAAIVAAPPR EPRAEPWRRT SGIHRARSRR QLAAVRAMWT
ARDRLARTRD VAPGRVLPDS AIMDAVLNAP TDAAALVRLP IFSGPRIRRT ANVWLDALRG
AAALPEEELP APAGPGGDGL PPPNRWAERD PVAASRLARV RAALAALAAA HTMPVENLLE
PALSRRLAWS PPNPLTDTAV AGALRTGGAR PWQIKLTAAP LLTVLAEPPA EPTEPAEPAE
PAAEPAG