Gene Franean1_2239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2239 
Symbol 
ID5670638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2675780 
End bp2677156 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content77% 
IMG OID641241159 
Productglycosyl transferase group 1 
Protein accessionYP_001506580 
Protein GI158314072 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.324068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00686408 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTTTACGC GGTGGACGGA TGATGGTCCT CGCCTGGGCG TTGGTCGCTG CCACCGTGCT 
CGTGTGCGAG TCGCCGTGGT CAGTGAGTCG TTTCTACCCC AGGTCGACGG CGTGACGAAC
TCGGTGTGCC GGGTGCTGGA GCATCTGCGC GACACCGGTC ACGAGGCGCT CGTCGTCGCC
CCGGCGCCGG CTCCGGCCGC CCGCCGGACC GCCCCGCGCA GCTACGCCGG GGCCCCGGTG
CTGTGGAGCC CGTCGGCGCC GATGCCCGGC TACCCCGAGT TCCGCTTCGC GACGCCGTGG
CCGGGGCTGG CCGCGGCGCT GCGCGAGTTC CGCCCGGACA TCGTCCACCT TGCCGCCCCG
GCCGGCCTGG GCGCCCAGGC CGCGTACGCC GCGCGCCGGC TGGGCGTACC GAGCATCGCC
GTCTACCAGA CCGACATCGC GGCGTTCGCG ACCCGCTACG GGCTCTCCGC GGCCGAGCGG
ACGATCTGGC GCTGGCTGGC CAGCGTGCAC CGCCTCGCCA CCCGCACGCT GGCGCCGTCC
TGGGACGCGG TGGACACCCT GCTCGCCGAG GGCGTGCAGC GGGTCGCCCG CTGGAGCCGG
GGCGTCGACC TCGAGCGCTT CCACCCCGAT CACCGCGACG AGCGGCTGCG GGCCGCCCTG
GCGCCGCGCG GCGAGGTCCT CGTCGGGTAT GTGGGACGAC TCGCCCGGGA GAAGCGCGTC
GAGCTGCTCG CCGGCATCGC CGACCTGCCG GGGGCGCGGC TCGTCGTCGT CGGCGACGGG
CCCTGCCGGC CGGCGCTGAC GAGGGCGCTG CCCGGGGCGG CGTTCCTGGG CTTCCGCACC
GGCGCCGACC TGTCGGCCGC CGTCGCCAGC CTGGACGTCT TCGTCCACAC CGGCACGCAT
GAGACGTTCT GCCAGGCGGC GCAGGAGGCG AAGGCCAGCG GGGTCGCGGT CGTGGGGCCG
GCGGCCGGGG GGCTGCTCGA CGTCATCGAG CACGAGCGGA CCGGCCTGCA CTACACGCCC
GGCGACCCGC ACGCGCTGCG CCGCGAGGTC ACCCGGCTGG TCGAGGACGG CGAGCTGCGG
GCCCGGCTGG CGAGCGCGGC GCGCGCCTCG GTGGCCGGCT GCGACTGGCA CGCCATCGGC
GACGAGCTGC TCGGGCACTA CCGGGACGTG CTGGGCACCG CGCAGCCCGC GGCCGGGCGG
CGCCACCGCC TGCCCACACG GCTCACCCGG GCCCGCCGCG GGCCCGGCGG CATGCGGATC
GCCGGACCGG CGGAGGCCAC GCGCCCCGCG TGGACGGCGA TGACACCGAC GACCGCGGTG
CCGGCGCTCG CCGCGACAAC CGCGACGACC GCGACGACCG ACGGGTGGCC AGGATGA
 
Protein sequence
MFTRWTDDGP RLGVGRCHRA RVRVAVVSES FLPQVDGVTN SVCRVLEHLR DTGHEALVVA 
PAPAPAARRT APRSYAGAPV LWSPSAPMPG YPEFRFATPW PGLAAALREF RPDIVHLAAP
AGLGAQAAYA ARRLGVPSIA VYQTDIAAFA TRYGLSAAER TIWRWLASVH RLATRTLAPS
WDAVDTLLAE GVQRVARWSR GVDLERFHPD HRDERLRAAL APRGEVLVGY VGRLAREKRV
ELLAGIADLP GARLVVVGDG PCRPALTRAL PGAAFLGFRT GADLSAAVAS LDVFVHTGTH
ETFCQAAQEA KASGVAVVGP AAGGLLDVIE HERTGLHYTP GDPHALRREV TRLVEDGELR
ARLASAARAS VAGCDWHAIG DELLGHYRDV LGTAQPAAGR RHRLPTRLTR ARRGPGGMRI
AGPAEATRPA WTAMTPTTAV PALAATTATT ATTDGWPG