Gene Franean1_2150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2150 
Symbol 
ID5670550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2579408 
End bp2580721 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content57% 
IMG OID641241071 
Productglycosyl transferase group 1 
Protein accessionYP_001506492 
Protein GI158313984 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCGCGGC ATGTACTCAG CACAAAGTCA GGGGTAGTAA TGGGCTGGAT GGTGTCACAG 
ATCGGCGCGC GAGAGCACTA CGCGACGGCT GTCGGGCTTG AGGTTTACTC GACGCTAGAC
CAACTGTACA CCGATGCATG GTGCCGTTTG CCGCTCAGCT CTATTCCGAA ACTTCCTGGC
CCGATGGTTC GTTACGCCGG ACGTGCAGAT CGGCGGATCC GATCGGTAAA GAGTTATGAA
CTACAGCTGG GCCCAAGTCA CATTCGGAGT GCTGTCGCGC ACCGTTTCTC CCGAAATTAC
GCGCACGAGC TTCGATGCAT AGAAATCGGC AGAAGGTTCG ACACGTTGGT GCGTAAGGAC
CTGCGTAGGC GCCGCTTCGA TCCGAATAAG GACGCATTCT TTGGCTACTT CGGTGGCGCG
TTGGAAAGCC TTCGATATCT GGGTGACCAG GGAGTCCCCA CGATACTCGA CCAGACTGGA
TGTGGACGGT CCTACTACGA AGAGGTTGCA GCAGAAAGGC TCCTTTGGCC GGAATGGGAA
GGGCGCCCTC CGACGATACA CGAGGCGTAT TTCGACCGAG CCCACGATGA ATGGAAAGCC
GCCTCCGCGG TGGTGGTTAA CTCCAACTGG GCCCGAAAAT CTGCGCTGAA AGAAGGGTGC
TCGCCCGACA AGATATTCGT GCTTCCTCTG GCCTACGATG CGCCGAGTCT GAAGGTTGGT
GCGCGGCCAC CACGTCCACA CGGTCGACTC AAGGTAATGT GGCTCGGTCG CGTTATTTTG
TCCAAGGGTA TCCAATATCT ACTGCTGGCA GCACAGCTAC TTCCCGAGGT CGACTTTATC
GTTGCGGGAC AGATCGGGGT AGATGCCAAC GTCCTACGAA AGGCAACGCC GAGTAACGTG
AAGTTCCTCG GCCCGATCCC GCGCTCCCAT GCAGCGGAGT TTCTGACCTC GGGTGACCTG
TTCGTTCTTC CAACTCTATC CGACAGCTTC GCGTTGACTC AGCTGGAAGC CATGTCGGCA
GGCCTTCCCG TTATTACCAC CGATCGTTGT GGAGATGTCG TGACCGATGG TCAGAACGGT
TATATAGTGC CAGTTCGTGA TCCTTATGCT ATTGCGAATG CTGTGGCACG GCTCGATTGC
GACCGAAATA TGCTCAAGGA ATTTTCTCGT CTTGCGGTGA TCCGCGCCAG GCAACTGTCC
CTGACGAAGT ACGTGGAGAA CCTGGAGACT ATTCGGCGAG GCATCTGCCC CGCGCCCGCG
ATGCGCGACG GCAGTCGCGG ATCAGATCCA TTCCAGGCTG CCACCGTCTG GTAG
 
Protein sequence
MPRHVLSTKS GVVMGWMVSQ IGAREHYATA VGLEVYSTLD QLYTDAWCRL PLSSIPKLPG 
PMVRYAGRAD RRIRSVKSYE LQLGPSHIRS AVAHRFSRNY AHELRCIEIG RRFDTLVRKD
LRRRRFDPNK DAFFGYFGGA LESLRYLGDQ GVPTILDQTG CGRSYYEEVA AERLLWPEWE
GRPPTIHEAY FDRAHDEWKA ASAVVVNSNW ARKSALKEGC SPDKIFVLPL AYDAPSLKVG
ARPPRPHGRL KVMWLGRVIL SKGIQYLLLA AQLLPEVDFI VAGQIGVDAN VLRKATPSNV
KFLGPIPRSH AAEFLTSGDL FVLPTLSDSF ALTQLEAMSA GLPVITTDRC GDVVTDGQNG
YIVPVRDPYA IANAVARLDC DRNMLKEFSR LAVIRARQLS LTKYVENLET IRRGICPAPA
MRDGSRGSDP FQAATVW