Gene Franean1_0743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0743 
Symbol 
ID5669159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp866747 
End bp868351 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content72% 
IMG OID641239670 
Productglycosyl transferase family protein 
Protein accessionYP_001505107 
Protein GI158312599 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1928] Dolichyl-phosphate-mannose--protein O-mannosyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000125548 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACGGCGA CGACGGCGCA CCTACCCCGT TCCGAGACCG GCGCATCGGA GCCCAGCCGC 
CCGGAACCGG CCACTGCCGC CAGAACCGGC CGTACGCCGC GCGAGCGGCT GTGCCCGCCG
ATGCCCGGCG ACCGCGCCGT CGGATGGCTG GCGCCCCTGT TCGTCGCCGT GACCGCCGGG
GTCCTGCGGT TCTGGCGGAT CACCGAGCCC CGCGGGCTGT ACTTCGACGA GGTCTACTAC
GTCAAGGACG CCTGGGGCCT CATGACCGCC GGGTACGAGA TCAACTCGAA GACCTGTGAC
GGGCCCGCCT ACGTCGTCCA CCCGCCGTTC GGCAAGTGGC TCATGGCCGC GTCGCAGTGG
CTGTTCGGCT ACGTCGACTG CGCGGGCACC CCGCACGGCA GCCCCGAGCT GGGCTGGCGC
TTCTCGTCCG CGCTCGCGGG CACGCTGGCG GTGCTCGTGC TCGCCCGCGC GGCCCGCCGG
ATGTTCCGTT CGACAGTGCT CGGCTGCTTC GCCGGCCTCC TGCTCTCCAT GGACGGGCTG
GAGTTCGTGC AGAGCCGGAT CGGCATCCTC GACATCTTCC TGATGACGGG CATCGTCATC
GCCCTGGCCT GCCTGGTGCA CGACCGTGAC GACGGCCGGC GCCGGCTCGC CGACCGCCTC
GACCAGGCCG CGGCCGGCGC CGGCCCGACG CCGGCCGACA CGCGGTTCGG CCCCCGGCTC
GGCCTGCGCC CGTGGCGGCT CGCGATGGGC CTGGCCCTCG GCGCGTCCAT GGGGGTCAAG
TGGAGCGCGC TGTACACGAT CGTCGGCTTC GCCGCGCTCG CGCTGGCCTG GGACGTCGGT
GCCCGCCGCA CTGCCGGCGC CCACTCCCCC GTGCTGGGCG CACTGCGCCG GGACCTGCCC
GGATGGCTGA CGGGCTGCGT CGCGGTGCCC GTCGTGACCT TCCTGGCGAC CTGGACGGGC
TGGTTCGTCA CCGACGGCGG CTGGTACCGC GACCGGTACG GGCACGGTTT CCTCGCCGCC
TGGCACGGCT GGTGGGACTA CCAGATGGAG GTGCTGCACT TCCACGAGGG CCTGTCCGAC
TCGCACCCGT TCCGGTCGAC CCCGATGAGC TGGCTGGTGC TCGGCCGGCC GATCGCCTAC
TTCTACAGCT CCCCGGCGTA TGGCGCCGAG GGCTGCACGG CGGTCAACGG CTGCTCGCGG
GAGGTCATCG CGCTGGGCAA CCCCGCCGTG TGGTGGGGCG GGACGGCGGC GCTGGTCGGC
TCGCTGGCGC TGTGGGTGCG CGCCCGTGAC TGGCGGGCGG CGCTGGTGCT CGTAGGCTTC
GGCTCGGCGT TCCTGCCCTG GCTGCTGTTC CCCAGCCGGA CGATGTTCTT CTTCTACGCG
CTGCCCTCGC TGCCGTTCCT GGTGCTGGGG CTGACGGCGA TGGCAGGGCT GGCCCTCGGG
CCGCGCGACG CGTCCGAGAC CAGGCGGCTC GCCGGCGCGC TGTCCGTCGG CGTGTACACG
GTGATCGTGG TCCTGTTGTT CGCCTACTTC TACCCGATCC TGGCCGCCGA GGTGATTCCT
TATTCATCTT GGCGAATACG TATGTGGTTT CCCGGGTGGA TCTGA
 
Protein sequence
MTATTAHLPR SETGASEPSR PEPATAARTG RTPRERLCPP MPGDRAVGWL APLFVAVTAG 
VLRFWRITEP RGLYFDEVYY VKDAWGLMTA GYEINSKTCD GPAYVVHPPF GKWLMAASQW
LFGYVDCAGT PHGSPELGWR FSSALAGTLA VLVLARAARR MFRSTVLGCF AGLLLSMDGL
EFVQSRIGIL DIFLMTGIVI ALACLVHDRD DGRRRLADRL DQAAAGAGPT PADTRFGPRL
GLRPWRLAMG LALGASMGVK WSALYTIVGF AALALAWDVG ARRTAGAHSP VLGALRRDLP
GWLTGCVAVP VVTFLATWTG WFVTDGGWYR DRYGHGFLAA WHGWWDYQME VLHFHEGLSD
SHPFRSTPMS WLVLGRPIAY FYSSPAYGAE GCTAVNGCSR EVIALGNPAV WWGGTAALVG
SLALWVRARD WRAALVLVGF GSAFLPWLLF PSRTMFFFYA LPSLPFLVLG LTAMAGLALG
PRDASETRRL AGALSVGVYT VIVVLLFAYF YPILAAEVIP YSSWRIRMWF PGWI