Gene Franean1_3722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3722 
Symbol 
ID5672087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4406517 
End bp4407698 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content71% 
IMG OID641242603 
Productglycosyl transferase family protein 
Protein accessionYP_001508023 
Protein GI158315515 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.671186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGTGG TACCACCGCT GGCCGGGCAT GTGAACCCGG CCACCGCGGT AGCGGGGGAA 
CTGGCCGCGC GCGGTCACCA GGTGGCGATT GCCGGTCATG CCGACGTCAT CGCGCCGATC
GTACCGGCGA AGGTCGACCT GCTCGCGCTG TCCGGGAGAC CGGCCGACGC CGCGGAGCGG
ACCAGAATAG AGGCCTCGTC CCAGCGGCTT CGTGGCGTAG CCGCGCTGAA GTTCCTCTGG
CAGGACTTTC TCCTCCCCCT CGGCGCGGCG ATGATCCCGG AGATCGACGC CATGGCCAGC
GAGTTCCGTC CCGACGTGGT GGTCGCCGAC CAGCAGGCTG TCGGCGCGTC GGTCGTGGCA
CGCCGACGAG GCACACGGCT AGCCGTCCTC GCCACCACGC CCGCCGAGTT CGACGACCCC
TACGCCGGGC TCGACCGCGT CGGCGCCTGG ATCGCCGGAC TGCTCCAGGA CTTCCAACTC
GCCCACGGCA TCCCCCCGGA ACAGGCCGCG GCCACGGACC CCCGGTTCTC CGACCAGCTC
ACCCTCATCT GCTCCGTGCC AGGCCTACTC AAAGCGGGCC GTTTCGCGGA ATCAGTGGTC
TTCGTCGGCT GCGCGGCCGC TCGGCGTCGT GCCGACCCGG ACTTCCCCTG GACATGGCTG
GACGAAACCC GGGCCACCGT CCTGATCTCA CTCGGCACGG TCACCCGCGA GGCCGGCCGC
CGTTTCCTGC GCGCCGCCGC CGAGGCGATG CTGTCGATGG CCACCGACGT CCAGGCCGTC
GTCGTCGCGC CACCGGGGAC CGCCACCGAC CTGGCCCTGG CGGCGCCCGC CGATCTCCTC
GTCACACCAC GCGTACCACA GCTCGCGTTG CTGCCACACC TGGCCGCGGT GATCTGCCAC
GCCGGCAACA ACACCGTGTG CGAATCGCTC GCACACGGAG TCCCACTCGT CGTCGCACCC
GTCCGCGACG ACCAGCCCAT CATCGCGGAA CAGGTAGAAC GAGCCGGAGC AGGCACCCGG
ATCCGATTCG GCCGTGCCGG CGCGGCAACG ATCGCCGATG CCCTGCGAAA CGTGCTCGAC
GATCCGACCT ACCGAGCGAC CGCCGGGCGA CTGCGACAGC AGTTCACCGC CGCCGGCGGC
ACGGCCACAG CAGCGGCCCA CATCGAGCAA CTCGCCAGCT AG
 
Protein sequence
MFVVPPLAGH VNPATAVAGE LAARGHQVAI AGHADVIAPI VPAKVDLLAL SGRPADAAER 
TRIEASSQRL RGVAALKFLW QDFLLPLGAA MIPEIDAMAS EFRPDVVVAD QQAVGASVVA
RRRGTRLAVL ATTPAEFDDP YAGLDRVGAW IAGLLQDFQL AHGIPPEQAA ATDPRFSDQL
TLICSVPGLL KAGRFAESVV FVGCAAARRR ADPDFPWTWL DETRATVLIS LGTVTREAGR
RFLRAAAEAM LSMATDVQAV VVAPPGTATD LALAAPADLL VTPRVPQLAL LPHLAAVICH
AGNNTVCESL AHGVPLVVAP VRDDQPIIAE QVERAGAGTR IRFGRAGAAT IADALRNVLD
DPTYRATAGR LRQQFTAAGG TATAAAHIEQ LAS