Gene Francci3_4034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4034 
Symbol 
ID3906995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4819024 
End bp4820157 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content70% 
IMG OID637881363 
Productputative cyclase 
Protein accessionYP_483113 
Protein GI86742713 
COG category[R] General function prediction only 
COG ID[COG1878] Predicted metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.368561 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTCCT CGGGGAGTGT TGCCTCCTGG GATCCTGGGC ATGCCTGGCC CCCTCGGCCC 
GCCTTCGGCC GCCCCGGGCT CCGGGGGCGG ACACCGGATG GAGATCATCG TCTCGTGAGC
ACCGAGCTGA TCGATCTGTC GGTACCCATC GTGACCGGCA TGCCCGTCTA CCCGGGGGAT
CCCGAGGTCG ACGTGTCACC GGCACTGACC ACGGCCGAGT CCGGCGTCAA CGTGCAGCGC
CTGCACCTGG GATCACAGAC GGGCACGCAT GTCGACGCCC CCTTCCACAT CGACGACTCG
CTGCCCAGGC TTGACGAGGT TCCGCTTCAA CGTTTCACCG GACCCGCGGT TCTGCTGGAC
GCGCGGGGCT ACGGTCCACA GGCGGCGATC GGCCCGGAGG TGCTATCAGG CCCCTTTACC
CGCCCGTTTC CTCACGATGT CGTCGTCCTC ATCGTCACAG GATGGTCCGT CCACTGGGGC
CACGACGGGT ACCTACGACA TCCCTATCTC GCGCCGGACA CCGCGCGGGC CCTCGTCAAC
GCGGGTGTCC GGACGGTCGG TATCGATGCG CTCAGCGTTG ATCGAACACC CGGACCGGGC
CAGGATGTCA GTCTCGCGGC CCACCGGATT CTGGGTGGGG CCGGTGCCGT CATCGTAGAG
AACCTGACCA GTCTCGATCG ACTTCTCACC GCCCGGGCGA ACGGACGGCC CATCCACATC
GTCCTGTTCC CGATTCCGCT CGCCGGAGCC GACGGTGCCC CGGTCCGGGC AGTCGCAAGC
GTCGGCCCCG CGGATCAGCT CCCCACCGCG TGCCGTCGCA GCAGCGGGTC GAGTTCGCCG
GACGTGTCCC CCACGACGTT CCCGCCCGCC GGTGACGCAT CGGCCGTCGG CCCGGCCGGC
GTCGCCGTAC CCGAGGGCAT GATCCCGGTG GGAGCCACCA GCCCACCGTG CGTCCCCGGG
GGGTTCACCT GTACCCGGCC GGGACGTGAG GTGGTCGAGC GCGTGCCCAC GCGGGGCGCG
GACCGGATGG CCGGCCGGGC CGAGGCCACG TCCACACCGA CGACACCGTC CACACCGACA
ACACCGTCCA CACCGACAAC ACCGTCCACA CCGTCCACAC CGGTCTCGCC GTAG
 
Protein sequence
MTSSGSVASW DPGHAWPPRP AFGRPGLRGR TPDGDHRLVS TELIDLSVPI VTGMPVYPGD 
PEVDVSPALT TAESGVNVQR LHLGSQTGTH VDAPFHIDDS LPRLDEVPLQ RFTGPAVLLD
ARGYGPQAAI GPEVLSGPFT RPFPHDVVVL IVTGWSVHWG HDGYLRHPYL APDTARALVN
AGVRTVGIDA LSVDRTPGPG QDVSLAAHRI LGGAGAVIVE NLTSLDRLLT ARANGRPIHI
VLFPIPLAGA DGAPVRAVAS VGPADQLPTA CRRSSGSSSP DVSPTTFPPA GDASAVGPAG
VAVPEGMIPV GATSPPCVPG GFTCTRPGRE VVERVPTRGA DRMAGRAEAT STPTTPSTPT
TPSTPTTPST PSTPVSP