Gene Francci3_1310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1310 
Symbol 
ID3904359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1569297 
End bp1570487 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content72% 
IMG OID637878643 
Productglycosyl transferase, group 1 
Protein accessionYP_480416 
Protein GI86740016 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.039712 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTCG TCGCCACCGA CGCGTGGTTC CATCACCGGC CGAGTGGGTG GCGGCGCCGG 
GGCCCGATAA CGGTGCTGAA GGTGCTGAAC CGGATGGAGC GCTCCGGCAT TCATCCGGGG
GCCGTGAACC TGCTCCGCCG CCTCGACCAG GACGAGTTCC GGCTGTTGTT CGCGGTGACG
TCCGGCGCGG CAGGAGCGTT CGACGGTGAG ATCCGGGCGC TCGGCGGCGA GGTGTACCAC
TGCCGGGCCG ACTGGCGCTT TCCATGGTCC TTCCTGAGGC TGCTGCGCCA GGTACGCCCG
GACGTGGTGC AGGCGGAGGT GGTGCAGGCG GACGTGACGA TCCTCTCCGG GGTGGTCCTC
GCGCTGGCCC GGCTCGGGGG GGTTCGGCGC CGCGTCGCCT ACCTCGCCGA TGCCCCGGAC
CGGCACGGCG ACAGCCTGGG CGGCCGGGTG CGGCGGATCG TCGGCCGGTT GCTGCTCGAC
CGGTTCGCCA CCCATCTGGT CGCGGTGAGC GAGGCGGTGA TGCGGGGTCT GTGGCGGGAG
AACTGGCGGC TCGACTCCCG TTGCCGCGTC ATCTACCACG GCGTCGAGCT GGAACCGGTC
GGCGTCGCCA TCGCGGCCCG CCGCCGCGCG GAGGAACTCG CCGAGGACGA TCAGGAGCTT
GTCACCATCG TCCACGTTGC CTGCCCGGAT TCGGCGAAGA GCCGGGACCG GGCCGTGGAG
ATTCTCGCGG CGCTGCGGGG GCGGTCTGTG AACGCCCGGC TCCTCTTCGT GGGTCGTCAG
GATGCCGCCG AGACGGCCCG GCTGGTCGCG CTGGCGTCTC GACGTGGCGT CGCTGACCAC
GTCGAGTTCA TCGGCGAGGT TCTTGAGATC CCACGCCTGC TGGTCGCGGC CTCGCTGCTG
CTGGTCACCT CCCGCCACGA GGGCCTGACC GGCATCGTGC TCGAGGCCTG CGCGGTCGGG
ACGCCCGTGC TCTGCGCGGA CCTGCCCGGG GTCGACGAGA TCGCCCGGCT GCTCCCCGGC
GTGACGATCC TGCCGCTGCG TATCTCCGAC GCGGTCTGGG CCGACACCGC CGAGATGCTC
ACCGCCGTTC CCCCGACCAT TGATCAGCGC CGGGAGGCGA TGCGTCTGCT GCGCCGGTCC
CCGTTCACGA TGGAGCACTG GCAACGCGAC ATCACGGCGG TGTGGTCGTA G
 
Protein sequence
MRVVATDAWF HHRPSGWRRR GPITVLKVLN RMERSGIHPG AVNLLRRLDQ DEFRLLFAVT 
SGAAGAFDGE IRALGGEVYH CRADWRFPWS FLRLLRQVRP DVVQAEVVQA DVTILSGVVL
ALARLGGVRR RVAYLADAPD RHGDSLGGRV RRIVGRLLLD RFATHLVAVS EAVMRGLWRE
NWRLDSRCRV IYHGVELEPV GVAIAARRRA EELAEDDQEL VTIVHVACPD SAKSRDRAVE
ILAALRGRSV NARLLFVGRQ DAAETARLVA LASRRGVADH VEFIGEVLEI PRLLVAASLL
LVTSRHEGLT GIVLEACAVG TPVLCADLPG VDEIARLLPG VTILPLRISD AVWADTAEML
TAVPPTIDQR REAMRLLRRS PFTMEHWQRD ITAVWS