Gene Francci3_0456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0456 
Symbol 
ID3903262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp533725 
End bp535032 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content72% 
IMG OID637877787 
Productglycosyl transferase, group 1 
Protein accessionYP_479571 
Protein GI86739171 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.778621 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.271812 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTGA TCAGAAGTGG CGCCGCCCAG GTCCGCGGTG AGGCCGAACG GCGCGGGCGG 
CCCAGCCGGG TCGCGATGCT GTCCATGCAC ACCTCACCAA TGGAACAGCC GGGAACGGGC
GATGCCGGGG GGCTCAACGT CTACGTCGTC GAGCTGTCTC GGCAGCTCGC GGCGCTGGGG
GTGGAGGTCG AGGTGTTCAC CCGCGCGGTG AGCAGCAAGC TGCCGACCTC GGCCGAGCTG
TTGCCGGGTG TGACGGTCCG CCACGTTGAC GCCGGCCCGT TCGAGGAGAT CCACCGGGAG
GATCTTCCCG CCTGGCTGTG TGCGTTCACC GCGGCTCTGC TGCGCGCCGA GGCCGGGCAC
GAACCAGGGT GGTTCGATGT GATCCACTCG CACTACTGGC TGTCGGGCCA GGTCGGTCTC
GCGGTGGCAC AACGATGGGG TATCCCGCTC GTGCACACCT CCCATACGCT GGCGAAGATC
AAGAACGGCG CGCTGGCCGT CGGAGACCGC CCGGAGCCGC CTGGCCGGCT ACTCGGCGAA
CAGGAGGTCA TCGGGGGGGC CACCCGGCTG CTCGCCTCCA CGCCGGACGA GTACCGGCAC
CTGATCGATC TGTACGACGC GGCGTCGGAC CGGGTCGACG TCGTCGCGCC CGGCGTCGAC
CTTGAGGTCT TCCGGCCAGG TGACATGGCG CAGTCCCGGG CCCGCGTCGG CGTGGATCCC
GCCGACGACC TGCTGTTGTT CGTCGGTCGG ATCCAACCGC TCAAGGCGCC CGATCTGCTG
CTGCGCGCCG CCGCGGAACT GCTGCGGCGC GATCCCGCCC GCCGCTCGCG GCTCACCGTC
GCCGTGGTCG GCGGCCCCAG CGGATCCGGT CTGGAACAAC CCGACGCCCT GGTCAAGCTC
GCGGCGTATC TCGGGATCTC CGATCGCGTC CGCTTCCAAC CGCCGGCCCC GCAGCAGGAA
CTCGTCCACT GGTACCGCGC GGCCACCGCC GTCGTCGTCC CCAGTCACAG CGAGAGCTTC
GGCCTCGTCG CGCTCGAGGC CCAGGCCTGC GGCACCCCGG TGGTCGCCGC GGCGGTCGGG
GGCCTGCGCA CCGCGGTCGC CGACGGTGTC TCCGGGCTGC TCGTCTCCGG TCGGACCCCC
GCCGTCTATG CCGACGCGCT GGACCGGCTG CTGCGCCAAC CACGATGGCG GGCCCGGCTC
TCCGCCGGAG CGGTGGCCTG GGCCGGTGGG TTCGGCTGGT CGGCCACGGC CCATGGCGTG
CTGCGCAGCT ACCGGCACGC GCTGAGCCCC ACCGCCGTCG CCGTCTGA
 
Protein sequence
MRLIRSGAAQ VRGEAERRGR PSRVAMLSMH TSPMEQPGTG DAGGLNVYVV ELSRQLAALG 
VEVEVFTRAV SSKLPTSAEL LPGVTVRHVD AGPFEEIHRE DLPAWLCAFT AALLRAEAGH
EPGWFDVIHS HYWLSGQVGL AVAQRWGIPL VHTSHTLAKI KNGALAVGDR PEPPGRLLGE
QEVIGGATRL LASTPDEYRH LIDLYDAASD RVDVVAPGVD LEVFRPGDMA QSRARVGVDP
ADDLLLFVGR IQPLKAPDLL LRAAAELLRR DPARRSRLTV AVVGGPSGSG LEQPDALVKL
AAYLGISDRV RFQPPAPQQE LVHWYRAATA VVVPSHSESF GLVALEAQAC GTPVVAAAVG
GLRTAVADGV SGLLVSGRTP AVYADALDRL LRQPRWRARL SAGAVAWAGG FGWSATAHGV
LRSYRHALSP TAVAV