Gene Franean1_0073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0073 
Symbol 
ID5668498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp90092 
End bp91351 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content70% 
IMG OID641239001 
Productglycosyl transferase family protein 
Protein accessionYP_001504446 
Protein GI158311938 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.657142 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTGTTG CAATCTTCGC GATGGGTACC CGGGGGGATG CCCAACCCGC GGCGATAATC 
GGCGCCGAAC TGGTCCGGCG CGGTCACGAG GTGGTGTTGG GCGTGCCCGG GGACCTTGCT
GGTTTCGGTA TCAGAATGGG ACTCGACACG GCGTCGATCG GCGTCGACGC GCACGAGTTC
ATGGGCTCCG AGGAGGTACG GGCCTGGCTG GCATCAGGTG ACCTGAGGAA AATCATGAAC
GGGTTCGGTC GGTACAAACG CCAGCGGGCG GAACGTATCG CCGACGCCAT GGCGGACATC
TCCACCGACG CAGATCTCAT CGTGTCCGGG GTGACCATCG AGGACGAGGC GGCCTGTATC
GCGGAGTGGC GGGGGGTGCC GATGGCATGT CTGCACCACG CGCCGATGCG GGCCAACGGA
GAGTTCCCCT TCTTCATCGC CAGCACCCGC CGGCTGCCGC GGGTCGTCAA CCGCCTGATG
TATCCGGCTG TCGAGTTCGC CGGGTGGCGG GCCCTCGCCG CCGACGTCAA CCGGCTGCGT
GCGAGGCTGG GCCTGCGGCC GGCCCGGGAA CCCACCCCAC GCCGGCTGGC GCGGGCCGGC
TCGACGGAAA TCCAGGCCTA CAGCCGGTTC CTGGTGCCAG AACTCGCTGA CTGGGGTCAG
CGTCGCCCGC TGGTGGGTTT CCTCACTCTG TCGCCCGAGC AGCGCCGGCT GCTCGGGGAG
CACCAGCTCG ACCCCGCCGT CGACCAGTGG CTGGACGAGG GCGAGCCACC CGCATACTTC
GGATTCGGGA GCATGCCGGT CCTGGATCCG CCCCGGATCC TCGAGTTGCT TAGCACGGTC
GCCGACAGAC TGGGGCTGCG CGCGCTGGTG AGCGGGGCGT GGGCCACGAC CGGCGTCAGC
GCCGACCGGC GGGTGTGCGT CGTCGGAGAC CTCGACCACG ACACGGTGCT CCCGCGTTGC
CGCATCGCCG TGCACCACGG CGGCGCCGGC ACCACAGCGG CCTCCGTCGC AGCCGGACTG
CCGACCGTCG TGTGCTCGGT CATCGGCGAC CAGCCCTTCT GGGGCGCCCG GCTCGAACGC
CTCGGTATCG GCGCATCCCT TCGCTTTTCC GAGATGAGCG AGCGGGCCCT CGTCGCTGCC
GCGGTCCCCC TGCTGGCCCA CGAACCACGG GAACGTGCAG CGCGGCTGGC CAGCCGGCTG
AAGACAGAGA ACGCGGCATG CCGTACCGCC GACGTTCTCG AGGAGATCCA CAAGTCCTGA
 
Protein sequence
MRVAIFAMGT RGDAQPAAII GAELVRRGHE VVLGVPGDLA GFGIRMGLDT ASIGVDAHEF 
MGSEEVRAWL ASGDLRKIMN GFGRYKRQRA ERIADAMADI STDADLIVSG VTIEDEAACI
AEWRGVPMAC LHHAPMRANG EFPFFIASTR RLPRVVNRLM YPAVEFAGWR ALAADVNRLR
ARLGLRPARE PTPRRLARAG STEIQAYSRF LVPELADWGQ RRPLVGFLTL SPEQRRLLGE
HQLDPAVDQW LDEGEPPAYF GFGSMPVLDP PRILELLSTV ADRLGLRALV SGAWATTGVS
ADRRVCVVGD LDHDTVLPRC RIAVHHGGAG TTAASVAAGL PTVVCSVIGD QPFWGARLER
LGIGASLRFS EMSERALVAA AVPLLAHEPR ERAARLASRL KTENAACRTA DVLEEIHKS