Gene Francci3_1565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1565 
Symbol 
ID3904797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1877388 
End bp1878578 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content66% 
IMG OID637878902 
Productmethyltransferase type 12 
Protein accessionYP_480670 
Protein GI86740270 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.663655 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.440278 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACC AGTCCACCGC CTGCCCCGCC TGCGGCGGTT CCCGTCTCAC CTCCGTCTAT 
ACAAAGGACG ACGTCCCGTC GCACAGTTGT CTGCTGCTGG CCGACGAGGA CGAGGCCCGG
GCGTTCCCGA AGGGTGATCT GCGGATCGCC TTCTGCGAGC GCTGCGGCTT CATCATGAAC
ACGGCCTTCG ATCCGACCAA GAATCAGTAC TCCGCCCGGT ACGAGGAGAC GCAGGCATTC
TCCACCCGAT TTCAGGAGTT CGCGCGGGAC CTCGCCAAGC GCTGGACCGA CAAGTACGAT
CTGTACGGGA AGACGGTCCT GGAGATCGGG TGCGGCAAGG GTGAGTTTCT CGTCCACCTG
GTGGAGCAGG GTGCCGGCGC CGGCATCGGG ATCGACCCGG GTGTCCGGCC CGAGCGCATC
ACCAGCCCGG TTGCCGGCCG GCTGACCTGG ATCACGGACC TCTACTCCGA GCGGTATGCG
CACCTGACCG CCGATGCCGT CGTGTGCCGG CACACCCTGG AGCACATCGC GCCGGTCGGC
GACTTCATGC GGATGATCCG GGCCGCGCTC GGTGACCGGA CCGATATCCC GGTCCTCTTT
GAGCTGCCGG ACGTCCTGCG GGTGCTGCAG GAGGCGGCGT TCTGGGATGT GTACTACGAG
CACTGCTCCT ATTTCAGCGC CGGTTCGCTG GCGAGGTTGT TCCGAGCTAC CGGGTTCGAG
GTGCTCGACG TCTCCCTCGA CTATGACGAT CAGTACCTGC TGATCGAGGC GCGGCCGTCC
ACCGTTCCGG CGGCCGGTGA CCCGCTGCCG ATCGAGGACG ACCTGGCCAC CCTGCGCGTC
GGGGTACGGC ACTTCCAGCG TGAGGTGGCC ACGACGCTGA ACCGGTGGAG CGAGATGCTG
TGGCGCGGGC ACCAGCGCGG CGAAAAGGCG GCGATCTGGG GTTCGGGCTC CAAAGGTGTG
TCGTTTCTGG CGACCCTCGG CCCGGCCGCC GACCTGGTCC GCTACGCCGT CGACATCAAC
CCGCACAAAC ACGGCATGTT CATGGCGGGC AGCGGCCACC GTATCGTCCC GTCCGAGTGG
CTGCGGGAAG ATAGGCCGGA TCTTCTGATC ATCATGAATC CGATCTATCG TGACGAGATC
GCGGGGGAGT TGACCCGGCT GGGCGTCGAC ACCGAGCTGA GGGCCGTCTG A
 
Protein sequence
MTDQSTACPA CGGSRLTSVY TKDDVPSHSC LLLADEDEAR AFPKGDLRIA FCERCGFIMN 
TAFDPTKNQY SARYEETQAF STRFQEFARD LAKRWTDKYD LYGKTVLEIG CGKGEFLVHL
VEQGAGAGIG IDPGVRPERI TSPVAGRLTW ITDLYSERYA HLTADAVVCR HTLEHIAPVG
DFMRMIRAAL GDRTDIPVLF ELPDVLRVLQ EAAFWDVYYE HCSYFSAGSL ARLFRATGFE
VLDVSLDYDD QYLLIEARPS TVPAAGDPLP IEDDLATLRV GVRHFQREVA TTLNRWSEML
WRGHQRGEKA AIWGSGSKGV SFLATLGPAA DLVRYAVDIN PHKHGMFMAG SGHRIVPSEW
LREDRPDLLI IMNPIYRDEI AGELTRLGVD TELRAV