Gene Francci3_3948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3948 
Symbol 
ID3906907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4726681 
End bp4728096 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content73% 
IMG OID637881275 
Productglycosyl transferase family protein 
Protein accessionYP_483027 
Protein GI86742627 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGGTGG GGTCGAGGTG CGCGCAGGCC CGCCGGGGGT CCATCCGATT CCGCGGGACG 
ACTTTGGCGC GGTGCGGGTG TCACGGCTGC GCGGCGTCAT GGCGTCATGG CGTCAATGGC
GCCGTACCGC AAAGGACGGA GCCGTTGGTT GTGGCTGACC GGCGGAACGC TACGGTAACC
GCCATGACAG GTGTCCGGAC CATCGGTCGA GGTGCCGCCG TGGCGGCGCT GGGAGCCACA
ACCGTTTACG GTCACGTCTT GTATCCGATC TATATCGGAT TGCGCAGCCG TGGTCTGGAG
TCGACGGCAC CGCCGGACCC TGAGATCTGG CCGGGACTGA GCGTGGTCGT CTCCGCCTAT
CGGGAGTCGG CGGTGATCGG GGCGAAACTG GATGAACTCA CCCGCGCGGA CTATCCCGGG
CCGATGGAGA TCATCGTCGT GGCCGACGAT CCGGAGACGG CCGAGGCCTC GCGCCGGCCC
GGCGTGCGGG TGCTGTCGTC CGGGGAACGC CTCGGCAAGG CGCGGGCGGT CAACCGGGGA
GTCGCCGCCG CCACCCACGA GCTCGTGGTG CTCACCGATG CGAACGCTGT GCTCGCGCCC
GGTGCCCTGC GGGCAGCCGC CCGTCATTTC ACCGACGAGA CGGTCGGCGC GGTGGCGGGG
GAGAAGCAGG TTGACGATCC CGACGGCGCC CAGGGCTTCT ACTGGAAGTT CGAGTCCTGG
CTGAAGCGCC GCGAGTCGGC GACCGGGGCG ACCATCGGCG TGGTCGGCGA GATGCTGGCC
TTCCGCCGCC AGGCGTTTCG GCCCCTGCCC GCGGACGTGG CCGTCGACGA TGCCTGGCTG
GCTCTCGACA TCCTCGAAGG GGGGCTGCGG GTCGTCTACG AACCCGAGGC GTACTCGATC
GAGTCGTCGA GCCCGGACTA CTCGGCGGAG TGGGAGCGGC GGACCCGGAT CGTCGCTGGC
AACCTCGACA TGCTCTGGCG GCGCCGGGCG GCGCTGGTGC CCGGCGCGCT GCCGGTCACC
CCGCAACTGT GGGGCCACCG GCTGGTCCGC TCGTCATTCG GCCCGTTGGC GCAGGTCGTC
CTGGTGGGGC TCGCCCTCCC GGCCGCCCGC CGGAGCTGGA TTGCCCGGCT GTTCCTGGCC
GGCAACGCCG TCGGCGCTGT GAGTACCGCG GCGCTGCTGA CCGGGCGCAC GCCGCCCGGT
CCGACCCGCC TGGTCGCGCA GGTCTTCTTC CTGCAGGCCG TCGCGCTCGG CGGGGTGCGG
CGCTTTGTGG CCCGGGACCG GCCCGCCGTC TGGCCCAAGC CGGAGCGGCC GGCCGTGGCC
TCCGCGACGT CACCGGCGCC CCCGGGGTCG GTCCTGCCAC CGGGGCAGAC GACGCCGCCT
GGCCCGCCGA CCGAGTTGGC CCCGGCCGGC AACTGA
 
Protein sequence
MRVGSRCAQA RRGSIRFRGT TLARCGCHGC AASWRHGVNG AVPQRTEPLV VADRRNATVT 
AMTGVRTIGR GAAVAALGAT TVYGHVLYPI YIGLRSRGLE STAPPDPEIW PGLSVVVSAY
RESAVIGAKL DELTRADYPG PMEIIVVADD PETAEASRRP GVRVLSSGER LGKARAVNRG
VAAATHELVV LTDANAVLAP GALRAAARHF TDETVGAVAG EKQVDDPDGA QGFYWKFESW
LKRRESATGA TIGVVGEMLA FRRQAFRPLP ADVAVDDAWL ALDILEGGLR VVYEPEAYSI
ESSSPDYSAE WERRTRIVAG NLDMLWRRRA ALVPGALPVT PQLWGHRLVR SSFGPLAQVV
LVGLALPAAR RSWIARLFLA GNAVGAVSTA ALLTGRTPPG PTRLVAQVFF LQAVALGGVR
RFVARDRPAV WPKPERPAVA SATSPAPPGS VLPPGQTTPP GPPTELAPAG N