Gene Francci3_3795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3795 
Symbol 
ID3906080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4547504 
End bp4548850 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content72% 
IMG OID637881121 
Productpeptidase M16-like 
Protein accessionYP_482874 
Protein GI86742474 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.417189 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCA GCGCGCTGAG CTCGACGACC ACCCGGGCGG CGGTCATCCC GGCCGTGCGC 
CCGACCGTCC CGGCGGAGTT GCCGGCCGTC ACCGACCACA CTCTCGACAA CGGTCTGCGG
GTGCTGGTGG TGGAGCGGGC CAGTGTGCCC CTGGTCGAGG TGCGGCTGCG GATCCCCTTC
GCCGGCGTCG GCGCCGTGCA CCAGGCCCGG GCCGAGGTGC TCGCGGAGAC GCTGTTCACC
GGATCACACC GCTTCGACCG GGTGGGCCTG GCCACCGAGG TCCAGCGCCT GGGCGGATCG
TTGTCGACGG GGGTTGACGC CGACCGGCTG GCCATCGTCG GTTCCGCCCT TGCGGTGAAC
CTCGAACCCT TGTTGGGAAT CATGGCCGAG GTGCTCCTCA GCGCCACCTA CCCGGACGAC
GAGGTCACCG GCGAACGCGA CCGGATCGTC GAGGACACGG CGATCGCCTG TAGCCAGCCC
GCGGTCATCG CCCGGGAGGC GCTGCTCGGC CGGCTCTTCG GTGACCACCC CTACGCCACC
GGCATCGCCG AGCCGGAGAC CGTCGGACAG GTCGGCCCCG AGGACGTACG GGCCCTGCAC
GCCGAACTGA TCTCCCCGGC CGGCGCGATC CTCACCCTGG TCGGGGACGT GCCGGCGCCG
CGTGCTCTCG CGGCGGTCTC CGCCGCGCTC GGCGGTTGGA CGGGCGGGCC CGCGCGGACC
GTCCCGCCGG TCCCAGCCCT CACCACCGGC CCGATCGTCA TCGTCGACCG TCCGGGTGCC
GTGCAGACCA ACATCAGGCT GGGCGGACCG GCCCTGGGCC GTTCCGCGGC CGGATACCCG
GCGCAGCGGC TGGCGAGCAC CATCTTCGGT GGTTACTTCA GTTCCCGGCT GGTCAACAAC
ATCCGCGAGG ACAAGGGCTA CACCTACTCC CCGCGCAGCT CGATCGATCA CTACCAGGCA
GGGTCACGGT TCACCGTCGC AGCGGACGTC GCCACCGAGG TGACCGGGCC CGCGTTGCTG
GAGATCTTCT ACGAGCTCGG GCGGATGGCG GTTCTACCGC CCAGCGAGGA GGAGCTCGAC
GCCGCCCGCC AATACGCCGT TGGCACGTTG GCACTGGGCA GCGCGACCGC CGCCGGTCTC
GCCTCGACGC TGTCGGCGCT GGCCGGGGCC GGGATCGGGG TCGAGTACCT GCGTGATCAC
CCGCGCGCCC TCGCCGAGGT CGGCGTGGGC GACATCCAGG CGGTGTCCGC GGACCTGCTT
GCCCCGGCAA AGCTGATCAC GGTGCTCGTC GGGGACGCGG CCAGGATTTC CGCGACCGTG
GGGGCACTGG GCATCGTCGC GACCTGA
 
Protein sequence
MSGSALSSTT TRAAVIPAVR PTVPAELPAV TDHTLDNGLR VLVVERASVP LVEVRLRIPF 
AGVGAVHQAR AEVLAETLFT GSHRFDRVGL ATEVQRLGGS LSTGVDADRL AIVGSALAVN
LEPLLGIMAE VLLSATYPDD EVTGERDRIV EDTAIACSQP AVIAREALLG RLFGDHPYAT
GIAEPETVGQ VGPEDVRALH AELISPAGAI LTLVGDVPAP RALAAVSAAL GGWTGGPART
VPPVPALTTG PIVIVDRPGA VQTNIRLGGP ALGRSAAGYP AQRLASTIFG GYFSSRLVNN
IREDKGYTYS PRSSIDHYQA GSRFTVAADV ATEVTGPALL EIFYELGRMA VLPPSEEELD
AARQYAVGTL ALGSATAAGL ASTLSALAGA GIGVEYLRDH PRALAEVGVG DIQAVSADLL
APAKLITVLV GDAARISATV GALGIVAT