Gene Franean1_6556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6556 
Symbol 
ID5674871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7974311 
End bp7975321 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content63% 
IMG OID641245405 
Productglycosyl transferase family protein 
Protein accessionYP_001510799 
Protein GI158318291 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.998721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTGG AAAGTGGCGC GCAGGCCGCT GCTGCCGAAT ACCAGGTTGC CGCCGAGACA 
TCATCTCGAC CGCGCCGCAC CCCGACGCCA CCGCGAACAG GTTCGGTGCC GCTGGTGACG
ATTGGACTGC CCGTCCTCAA CGGGGAGAAC TTCCTGGAGC GGGCGCTCGA GTCGCTTGTC
ACCCAGGACT ACCCAAACCT GCAGATCATC GTGGCGGACA ATGGCAGCAC GGACCGCACG
GAAGAGATCT GCCGTGCTTT CACCCGTCGA GACGCGCGGA TCGAGTACCA TCGCAGCGCC
GTGAACCGGG GTGCAGCTTG GAACTACAAC CGGCTCGTCC GACTTGCCGC GGGTACGTAC
TTCAAATGGG CAGCCCATGA CGACCTGTGC GCTCCCTCCT TGGTCAGCCG GTGTGTCGCG
GGCCTGGAAG CCGGGCCCCG AGACGCCGTC CTGGCCTATC CGAAAACTGC TCTCATCGGC
CTGGATGACG CGGTTATCGG TGACTTCGAG GACGAGATGG ATCTTCGGGA GGAGCAGCCG
CACGAGCGGC TAAAGCATTT TCTGTCGACC CGGACAGAAT ACCACCCGGT ATTCGGAGTA
ATCCGGACAG AAGTGCTAGG GGGAACCAGT CTCATCGGCA GGTATGTCGG TTCCGATGTA
GTCCTTCTGG CAGAGCTAGC ATTGCGTGGA AAGTTCATCG AGGTGCCGGA GCGCCTGTTC
CTCAGGCGTT TCCATGCGGG CACGTCCATG AATGCTAATC CGGGGGCCAG GGAGCGCGCT
TCGTGGTTCG ACCCGAAGAG GCGGGTGCCG GCGATGCCGA TGACGGAGCG AACCGTGCGG
ATGGCCATCA CCATCTGCTC CTGTGAACTA CTTGGTCACG TCGAGCGTGT TCGCTGCCTG
GCGGCTCTGG GACAACATTG GGCACGCCCC TATGCACGGC ACATGGGGGG CGAAGTACGC
GCGATAGCCG CGGACCGACT CGTACCCCGG CTCGGCTCGC TGCTCACCTA G
 
Protein sequence
MRVESGAQAA AAEYQVAAET SSRPRRTPTP PRTGSVPLVT IGLPVLNGEN FLERALESLV 
TQDYPNLQII VADNGSTDRT EEICRAFTRR DARIEYHRSA VNRGAAWNYN RLVRLAAGTY
FKWAAHDDLC APSLVSRCVA GLEAGPRDAV LAYPKTALIG LDDAVIGDFE DEMDLREEQP
HERLKHFLST RTEYHPVFGV IRTEVLGGTS LIGRYVGSDV VLLAELALRG KFIEVPERLF
LRRFHAGTSM NANPGARERA SWFDPKRRVP AMPMTERTVR MAITICSCEL LGHVERVRCL
AALGQHWARP YARHMGGEVR AIAADRLVPR LGSLLT