Gene Franean1_7195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7195 
Symbol 
ID5675496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8785229 
End bp8787895 
Gene Length2667 bp 
Protein Length888 aa 
Translation table11 
GC content71% 
IMG OID641246032 
Productputative glycosyl transferase 
Protein accessionYP_001511420 
Protein GI158318912 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.80662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCATACG ACTACGCGAC CTTCTCGACG CTCGCCGGGC CGTTGACGCG CCCGGCCGAA 
GGTGCGGTGC ACCACGTCGA GTACCGGCGC ATCATGGCAG GTCGTGTGCT CTGGCAGGTC
ATGCCGCTGC TCATCGCGAC ACTGGGCCTG TCGTCGTTGT TCTTCGTCTG GCTTCTCCAG
CCCCAGCACT ATCCCAGCAC GAAATACATA CCGACGGCGG TGGCCGTCGG GAACCACATC
ATGTTCTGGC TGATCGTGGT GACCGAGGGC ATCCGCCTGC TCAGCTCCAT CATCCTGTGC
TGGTCATCAG TGATCATGCG CGACCCGGTG CCGGTGCTGC CACCCCCGGG ACTTCGAGTG
GCGTTCACCA CGACGATCGT GCCGTCCAAG GAACCCGTCG ACATCGTCCG CGACACCCTG
ATCGCCGCCC GCAACATTCA GTACAGCGAG CAGATCGACG TCTGGCTGCT TGACGAGGGC
AACGACCCGG CGGTGAAGGC AATGTGCCGG GAGACCGGCG TGCACCACTT CTCCCGCAAG
GGCGTGGAGA AGTGGAACAC GCCGAAGGGA CGGTTCCGCG CCAAGACCAA GCACGGCAAC
CACAACTCGT GGCTTGACGC GAACGGCCAC AAGTACGACG TCGTGCTCTC CGTCGACCCG
GATCACATCC CGCTGCCGAA CTTCGCCGAC CGGATGCTCG GGTACTTCCG CGACCCGAAT
GTGGCCTTCG TGGTCGGCCC GCAGGTCTAC GGGAACTTCC GGCACTATCT GACCCGGGGC
GCCGAGGCCC AGAACTACAT GTTCCACTCG GTGATCCAGC GGGCCGCGAA CCGCTTCGCG
GCCGGCATGT TCGTGGGTAC CAACCACGCC TACCGGGTGT CCACCTGGGA TCAGATCGGC
GGGTTCCAGG ACTCGATCAC CGAGGACCTG GCGACATCCT TCGCCGTGCA CGGCGCGTTC
AACGAGGTCA CCGGGCACCG CTGGACGTCG GTCTACACCC CGGACGTGGT CGCCGTGGGC
GAGGGCCCGG CGAACTGGAC GGACTTCTTC AGCCAGCAGC TGCGGTGGGC GCGGGGCGCC
AACGAGGTGA TGGTCACCGA GGCCCCGCGG CGGCTGAAGG CGCTGAGCTG GGGACCGCGC
CTGCACTACC TGACGCTGAT GGTGCACTAC CCCACCGTCG CGATCACCTG GATCGTCGGC
AACCTGCTCA CCGTGCTCTA CATGGCGCTC GGCTCGACCG GCGTCCTCGT CAACGTCTCG
TTCTGGCTGG CGCTCTACGT CGACGTGTTC GTCGCGCGGA TGCTGCTCTA CTTCTGGCTG
CGGCGGTTCA ACATCAGCCC GCACGAGGAG AAGGGCAGCG CGGGCATGAG CGGGATCTTC
GTGTCCGTGC TGTGCACGCC CTTCTACTCG ACGGCCTTCG TCGGCGCGCT CACCCGCCGC
AAGCTCGGCT TCGTGGTCAC CCCCAAGGGG AACGCGGCCA GCCCGGACCG CCTGATGACC
TTCCGCAAGC ACCTGTTCTG GGCGGCGGTC TCGGGCGGAT CCGTGGCCGG AGCCGCGGTC
TTCGGTCACC TGTACCCGGC GAACATGGTC TGGGCCTCGC TGTCGATCAT CACCTGCCTG
ATTCCCATCG GGCTGTGGCT GATCGAGCCG ATCCTGGCCC CGCGGCGGGC GGTCGCTCCC
AACCCGTTGC CGGCCGCCCA CATCCCGCCG CAGCGCGCCG GCGACCGCGC ACCGGCCGAC
GCCCGCCCCG GCGCCCAGCG CCACGCCGAC CGCATCCCCG CCGGTCGCGG CGGCGTCGAC
CCGGCCACGG CCGAGACGTC CACCCTCGAC CCGAGCGCCG CGGACACGAC CGTCATGGGG
GCGATCGGCG CAGGGAGCGC CAGTGGGAAC AGGGCCGGCG GGAACGGGCC CGGCGGAAGG
GGACCCGGCG GGAAGGGATT CGGTGCCAGG CCGTCCGACG TCGACCCGGC GACCGTCGGC
ACGCCGCTCC CGGAGGCACC CGGCACCGAG CGGACGGTTC CGCCTGTTCC CCCGCCGGCG
CCGCCGCTCA CCCAGCCGCA CCCGGTACGG CCGGCACGCC CGAACCAGCC TCGGCCCGCC
CAGCCGGGAC GTCCCGCCGC ACCCGGGCGG CGGCGTGCCA GGCCGGCAGG CTCCCCGGAG
CACGCCGGCA CCGGGCGGGC CGGGTGGGAG CGCGACGATC CGACGCTGAC CGGCCTCGAG
CCCGTCGGCG CCCGCGGCGG CGCGGAGGGC GGCGAGGATG CCGAGCAGTC GCGGCCGCGG
GACCCGGGCT GGTTCAGCGA CGACACCGTG CGTACGGAGA AGCCGAGGAT CGCGGCCATC
GTCGCCGCCG CCGACGCCAA CAGCCCCGGC ACCGGCACTC CTGAGGACAC GGTCCGGACG
TCCCGGCCGG TCGTCGGCGC CGTCCTGGCC GCCGCCGGGC TGCGCCGTGA CTCGGCTAAC
CCCGACGACG TCGATCAGGA CACGGTCGTG ACTCTCCGTT CCCGGCAGGG GTCGCCGGAG
GAGGTCCTCG CCGCCCGGCG GGCGGCGCTC GCCCGGGAGG GCCGCTCCCC CGCCGGCGTT
CCGCACTACA CCGAGGACGT CACGATGACC CTGCATCCCC GCCGCCGGCG GGTGTCACTC
GACGGCCTGT TGGAAGAGAT CGGCTGA
 
Protein sequence
MAYDYATFST LAGPLTRPAE GAVHHVEYRR IMAGRVLWQV MPLLIATLGL SSLFFVWLLQ 
PQHYPSTKYI PTAVAVGNHI MFWLIVVTEG IRLLSSIILC WSSVIMRDPV PVLPPPGLRV
AFTTTIVPSK EPVDIVRDTL IAARNIQYSE QIDVWLLDEG NDPAVKAMCR ETGVHHFSRK
GVEKWNTPKG RFRAKTKHGN HNSWLDANGH KYDVVLSVDP DHIPLPNFAD RMLGYFRDPN
VAFVVGPQVY GNFRHYLTRG AEAQNYMFHS VIQRAANRFA AGMFVGTNHA YRVSTWDQIG
GFQDSITEDL ATSFAVHGAF NEVTGHRWTS VYTPDVVAVG EGPANWTDFF SQQLRWARGA
NEVMVTEAPR RLKALSWGPR LHYLTLMVHY PTVAITWIVG NLLTVLYMAL GSTGVLVNVS
FWLALYVDVF VARMLLYFWL RRFNISPHEE KGSAGMSGIF VSVLCTPFYS TAFVGALTRR
KLGFVVTPKG NAASPDRLMT FRKHLFWAAV SGGSVAGAAV FGHLYPANMV WASLSIITCL
IPIGLWLIEP ILAPRRAVAP NPLPAAHIPP QRAGDRAPAD ARPGAQRHAD RIPAGRGGVD
PATAETSTLD PSAADTTVMG AIGAGSASGN RAGGNGPGGR GPGGKGFGAR PSDVDPATVG
TPLPEAPGTE RTVPPVPPPA PPLTQPHPVR PARPNQPRPA QPGRPAAPGR RRARPAGSPE
HAGTGRAGWE RDDPTLTGLE PVGARGGAEG GEDAEQSRPR DPGWFSDDTV RTEKPRIAAI
VAAADANSPG TGTPEDTVRT SRPVVGAVLA AAGLRRDSAN PDDVDQDTVV TLRSRQGSPE
EVLAARRAAL AREGRSPAGV PHYTEDVTMT LHPRRRRVSL DGLLEEIG