Gene Francci3_1146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1146 
Symbol 
ID3903574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1360375 
End bp1362795 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content71% 
IMG OID637878478 
Producthypothetical protein 
Protein accessionYP_480254 
Protein GI86739854 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0503297 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCC GGCACGAGAT GGGGCAGACG GGGCCTGTCT CCGTCGAACC CGGGACAGAG 
CCGCAGGCGC CCTTCGACGT CATCGATGCG CAGGTTGTCG TGAGTCCCGG CTTGGAGAGC
GGCCTTGCGC ACGGGACACC CGGATCGGTG GTCGTCGCCG CGCCCAGGGG TGACGCCGTG
CCCAGGGGTA ACACTGGGCT CGGGGGTAAC ACTGGGCTCG GGGGTGACGC CGAGCCCCGG
AACGACACGC CGCGGTCCCG GCCGGACGGA GAACAACCGC CGGCCGGGCC GCCGGCCTCC
GACGCCGCTC GCGGTCGGCG GACTCCCGTC CAGCGGCTGC GCGTTCGTCT CGCGAGCATC
TCGGGGCCCG CTCGGTACGT GATGGTGCTC GCCGTTCTGA GCGCGGCGGT CGCCGTCGTC
CTCCGGCACA CCGTGTTCCC GTTCCTGTCG GTCAACAACG ACGAGGGCAT CTACCTCCTG
CATGCCAGGA CCCTCGCCGA GGGCCGCCTG TTCCCGCCCG CCCCGGACCC GGCACGGTCC
TACCTGCCGT GGCTGGGAAC CGTTGTCGGC GATCACTACG TGTTGAAGTA CACGCCCTTC
GTACCCGCGA TCTTCGCAAT CGGCATCGTG CTGACGGGAA GCATCGATCC GGCGCTTGCA
ATGATCGTCA TGGCGGCGGT CGTCGTCACC TACCTGCTGG GCGTCGAGCT GTTCGGTAGC
CGGCGGACCG CGGCGTTGGC GGCGACCCTG CTCGCCCTCT CCCCGCTGGT GATCCTGCAG
AGCGCGATGG TGCTGAGCTA CTTGCCGACC CTGGTGCTGC TGCAGGTGAC CGTTCTGGGG
GTGTTGCGGG GACGGCGCCG CCGACGGGCG GCACCGGTGG TCGTCGCGGG CCTCGCCCTC
GGCGTGGCGG CGGCGGTGCG GCCGTATGAC GTCGTCCTGC TCCTGGCTCC GCTCGCGATC
TGGCTGCTCC TGACCTCCAC GGGCGGGCGC TGCTGGCTGG GCCGGTGGCT GCTTTGCGGG
CTCGCCGCGC CCGTGGCGCT TGTCCTGCTC AGCAACGCCG CTGCCACCGG TGACCCGTTG
CGGCTGCCGT TCACGCTCCT TGAACCCGAC GACAAGCTCG GCTTCGGGGT GCGCAAGCTG
TATCCCGGCG ACGGCAGGCA CGACTTCGGC TTGCTCGAAG GGCTGCGGTC GGTCGGCGAC
CACCTGTGGC TGTTCGGCGG CTGGGCCTGC GGGGGAGTGG TGCTGGCCGG CTGCGCGATC
GTCGTGGCGG CTCGGCGGCA GCTGTCCGCA CCCGCCGCGC TCCTGGGTGC GGGTGGGCTG
CTCTTCCTGG GCGGATACGT CGGCTTCTGG GGCGCGTGGA ACGCGGCCGA GTTGTGGGGC
GGCATCCGCT ACGTCGGCCC TTTCTACCTG GTCCCGGTGC TCATCCCGCT CGTCCACCTC
GGCGCGGAGG GCCTGGTCAG GGCCGGCGAA GCCGTCTTCG CCCGCGGGCG AAGACTCGGG
CTCATCGCCG TCGCGGGGAC CGGCGCCGCC ATCCTGGCGC TCACGGCCGT CGTCCTGGTC
GGGGCGGTGC GCGCCAACCT CACGCTCACC GGGCACGACC GTGACCTCAG CGCGATGCTG
GACCGGCTGC CGGGCGATCC GCTCGTGTTC GTCGCGGCGA ACCCGCCGTT TCTCGGGCAT
CCCACTCCGG TGACCGCGAA CGGGCCGAAG CTCGAAGATC CTGTGCTGTT CGCCGTGTCG
CGGGGGGTCG ACGATCTCAT CGTGGCCGCT GACCACGCGG ACCGCCCGGC CTATCTGCTC
CGCCTGGCAT CGGCCTACGA CCGCTCACCC GCCTCGCCCT CCACCGCGCG GGTCGAACGG
CTGGCGGTCA GGAGCGGCAG GCGGGTGGCG GTGACCCTGA GAGTGGATGC CGCGCCCGCG
GGCGCCCGTT CCGCCCGCGT CGTGCTCACC GAGGGTGTCC GCCGCCTGTC GATTCCGGTC
CATCCGCGGA TCCCCACCTC GATGATCCTC ACCGTCGACG CGGACGGCCT CGATACCACC
GATGTCATCA GTATCACCAG TATTGCCAAC ATCGCTGGCG GCATGGACAG CACCGGAATC
GTGACTGACC GCAACGACTC CGGTCGGTCC CGGGTGCATT CCGGATCCCT CGGCCGTCCG
TCGGCAAGCA CCGTGCGGGA CGGTCGGGGC ACCTCCATCA CCGTGGCCTA CTACGCGACG
AGCCCTTCCG GACGGGAACA TTTCCTCGAT CAGCAGGTGT TGCCCGTGCG GCTGGCCGCA
CGGGACGGTT CCCGGCCGGA CGGTTCCCGG CCGGCTGCCG GCCCGCCGGT CACCGTGCTC
GGATCCCTCG GGGAGGTCGA TGAGGTCGGC GAGGGACGTC GGCCGCCGCT GCGCATCATC
ATCGATGATC GTCGAGGCTG A
 
Protein sequence
MSSRHEMGQT GPVSVEPGTE PQAPFDVIDA QVVVSPGLES GLAHGTPGSV VVAAPRGDAV 
PRGNTGLGGN TGLGGDAEPR NDTPRSRPDG EQPPAGPPAS DAARGRRTPV QRLRVRLASI
SGPARYVMVL AVLSAAVAVV LRHTVFPFLS VNNDEGIYLL HARTLAEGRL FPPAPDPARS
YLPWLGTVVG DHYVLKYTPF VPAIFAIGIV LTGSIDPALA MIVMAAVVVT YLLGVELFGS
RRTAALAATL LALSPLVILQ SAMVLSYLPT LVLLQVTVLG VLRGRRRRRA APVVVAGLAL
GVAAAVRPYD VVLLLAPLAI WLLLTSTGGR CWLGRWLLCG LAAPVALVLL SNAAATGDPL
RLPFTLLEPD DKLGFGVRKL YPGDGRHDFG LLEGLRSVGD HLWLFGGWAC GGVVLAGCAI
VVAARRQLSA PAALLGAGGL LFLGGYVGFW GAWNAAELWG GIRYVGPFYL VPVLIPLVHL
GAEGLVRAGE AVFARGRRLG LIAVAGTGAA ILALTAVVLV GAVRANLTLT GHDRDLSAML
DRLPGDPLVF VAANPPFLGH PTPVTANGPK LEDPVLFAVS RGVDDLIVAA DHADRPAYLL
RLASAYDRSP ASPSTARVER LAVRSGRRVA VTLRVDAAPA GARSARVVLT EGVRRLSIPV
HPRIPTSMIL TVDADGLDTT DVISITSIAN IAGGMDSTGI VTDRNDSGRS RVHSGSLGRP
SASTVRDGRG TSITVAYYAT SPSGREHFLD QQVLPVRLAA RDGSRPDGSR PAAGPPVTVL
GSLGEVDEVG EGRRPPLRII IDDRRG