Gene Francci3_1167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1167 
Symbol 
ID3905278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1392902 
End bp1394719 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content75% 
IMG OID637878499 
Productglycosyl transferase, group 1 
Protein accessionYP_480275 
Protein GI86739875 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.290628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGCCCC CGGGAGGATC GACACACCTC GGAGGGTCCC GGGAGCGCCG GTACCTGCCC 
ACCCTCGCCG GCCGGCACGT GGTGTTCCTC AACTGGCGGG ACCGGGAGCA TCCGCAGGCC
GGCGGGGCGG AGTTGTTCTG CCAGTCGATC GCCGAGCGGT TCGCCGCGGC CGGTGCCCGG
GTCACCCTGC TGACCTCGCG GGCCACGGAA GGTGAACGCC CTGGTCCGCC CGTCGCCGAG
TCCGTTGGCG GCGTCGAGGT GCGCCGCGGG GGCGGTACCT TCGGCGTCTA TCCATCGGTG
CTCGCCCGGC TGGCCAGCCT GGGCCGGGCC GGGCACCGCG TCGACGCCGT CGTCGACTGC
CAGAACGGGA TCCCGTTCTT CAGCCCCCTG GTGCTGCCGG CGAGCACCCC CGTCGTGCAG
GTGCTCCACC ACGTTCATCA GAAACAGTTT CCGTTGTACT TCCCCGGGCC GGTCGCCCGG
GTGGGGCAAC TGCTGGAGGC GCCGGGCAGC CGGTGGGTCT ACCGGCGCCG GCCGGTGGTG
GTCGTCTCCC CCTCGACCCG CGCCGAGGCC CGTGACATCC TCGGCCTGCC CGGCCCGCGC
TTCCTGGTCC CCAACGGGGT GACCACCTCG GCCGCCGGGG CCGCCGGGGC CGCCGGGGCC
ACTACGTTCA CGGGCCCGGC TGGCGGTGGG CCCGCAGGCT CGGATGGCGG CCCCGCGACG
GCACCCACCA TCGTGTGCGT CGGCCGGCTC GTGCCGCACA AGCGGCTCCA CCTGCTCCTC
GACGCGCTGC CCGCGCTGGT CGGGGCGCAT CCCGGGCTCA CCGTCCACAT CGTCGGCGAC
GGCCCGGACC GCGAACGGCT GACCGCCCGC GCGGTCGCCC TCGGGCTGAC CACTACCGGT
GCGGTGACCA CTACCGACCC GGGGGCGGCT CCGGGCTCGG GAGGCACGAC CGACCGGGAC
GGTGACGCCG TGCGCTGGCA CGGCTACACC GATGCCGCCA CCCGCGACCG GCTGCTTTCC
TCGGCCTGGC TGACAGTCAA CCCCTCGCAT GGTGAGGGCT GGGGGCTGTC GGTCCTGGAG
GCCGCTGCCC TCGGGGTGCC CGCGGTGGCG TTCCGGGTCC CGGGGCTGCG CGACGCGGTC
CGGGACGGGA CTACGGGATG GCTGGTCGAC GAGGGCGAGC CGCTGGAGAA GACCCTCGAC
GCGGCGCTAA GGCTACTGGC CGAGCCGCCC GCGGCGGCCG AGCTGCGCGC GGCGGCCCGC
GCCTGGGCGG CCGGGTTCAC CTGGGACGCC AGCGCCGAAC TGCTGGCCCG GGTGGTCACC
GCCGAGATCG ACCGACTGGC CGACCCGGCG GCGAAGCGGG CCGAAGCAGG AGAACAAGGC
CGCGCGCCGC AGCGGGGTCG GGGACCGGAT CGGGGCCTGG AGCGCCGGCG CCGGGACGAT
GACCTCACGC ACGCCCGTTT CACCCTGCCG GTCGGCACCT CCCGCCCGGC GTTGCGCCGC
ACGGACCTGG TGTACCGGCC GGATCCGACC GGCCCGGAGA TGATCGCGCT CCTCTACGGC
GCCGGCCCCG CCGCCGCACG CACCGCCCTG ACCCGGGCCG GAGTCGGGGT CGGCGCGCTC
GCCGACCTGC GGCTACGCCC GGCGACCGGC GAGGACCTGC TGTTCGCGGC CAGCCGTGAC
CTGGCGGCAT CTCCCCCGGA TCCGGGTCCC GGTCCGATCC CGACCGACAA CCGGGGCGAT
CAGGCGAAGG CCGGAGCGCG ACGAGCAGCC TCGAACATGC TTTCCACGGA GCTAGCGGGA
AAACCAGGTG ACCGTTGA
 
Protein sequence
MTPPGGSTHL GGSRERRYLP TLAGRHVVFL NWRDREHPQA GGAELFCQSI AERFAAAGAR 
VTLLTSRATE GERPGPPVAE SVGGVEVRRG GGTFGVYPSV LARLASLGRA GHRVDAVVDC
QNGIPFFSPL VLPASTPVVQ VLHHVHQKQF PLYFPGPVAR VGQLLEAPGS RWVYRRRPVV
VVSPSTRAEA RDILGLPGPR FLVPNGVTTS AAGAAGAAGA TTFTGPAGGG PAGSDGGPAT
APTIVCVGRL VPHKRLHLLL DALPALVGAH PGLTVHIVGD GPDRERLTAR AVALGLTTTG
AVTTTDPGAA PGSGGTTDRD GDAVRWHGYT DAATRDRLLS SAWLTVNPSH GEGWGLSVLE
AAALGVPAVA FRVPGLRDAV RDGTTGWLVD EGEPLEKTLD AALRLLAEPP AAAELRAAAR
AWAAGFTWDA SAELLARVVT AEIDRLADPA AKRAEAGEQG RAPQRGRGPD RGLERRRRDD
DLTHARFTLP VGTSRPALRR TDLVYRPDPT GPEMIALLYG AGPAAARTAL TRAGVGVGAL
ADLRLRPATG EDLLFAASRD LAASPPDPGP GPIPTDNRGD QAKAGARRAA SNMLSTELAG
KPGDR