Gene Francci3_1574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1574 
Symbol 
ID3904806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1888566 
End bp1889642 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content65% 
IMG OID637878911 
Productglycosyl transferase, group 1 
Protein accessionYP_480679 
Protein GI86740279 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.178155 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGAAT GTAATCTTGC GCTTCACTAC AGCGGCGGCC TTGACGCGAA CAAGTGGCGC 
GAGCGTCACG GTCGCGGCGA GGTTCCGGAC GCTCTTCCGT ATGGGTTCGA TCGGTTGGCG
CGCTATTCAG TGAAAGTCTC TCCGGTTAGC TTGCGGGTAT CCCGGGACGC GTTCTTCGTT
CGTGGTCCGC GTATTTTCGG CGGCTATGAG TGGGTGGAGA CCGCGGCCTC CCGCCGAACG
GTCGGTGGCT CGGACGTCGT CTTGTGCTGG GACGAGCGGG TCGGGGTCCC GGCGGCGCTG
ATGGCGGGAA CGCGAAACCA TCCCCCCGTC GTGACCGGCG TGATCTGGCT GGCCGATGTC
GCCGGTCGAC GGCCCGGCTC GCTGAAGATC GCCCGTTGTG CCCTGCGAAG GTCCAGCCGG
ATATTTACGC TTTCCACCGC CCAGATTCCG CCGCTCCGAA CGCAGCTGGG CATTCCTGAA
GGACGTTTCT CGCACGTCCT GTTCGGGGTC GACTCCACAT TTTTCCGGAC CTCTGTCAAC
CCTCCGAAGA AAGGGCTCGT GGTGAGTGTC GGAAATGATC GGCACCGCGA CTTCGAAACG
TTGATCCGCG GCCTGTCCCA GGGGTGGAGC AGGCTGGCGG AGCTGAGGCT GGCGGAGCTG
AGACTCGAGC TGGTGACAGC CCGAAACGTG GCCGTGCCAC CGTGGCTCGG CCGGCGTCGG
AACCGTCTTA CTCTTCCCGA GGTGAGCGAT CTGTACAGGG CCGCGTCGAT GGTGGTCGTC
TGCCTCCGGC CCAATCTTCA CGTGAGCGGC GTAACCGTCG TGCTGGAGGC GATGGCCAGT
GGTCGTCCCG TCGTGGTGAC CGATACCCCC GGCATCCGGG ACTACGTCGA TCACGGCCGT
ACCGGGATTC TCGTTCCGCC CTACGACACC GATGCGCTCG CTGAAGCGGT CGTGCATCTG
GTGCTCGACC CGGACCGGGC GGCCGCGCTC GGTGCGGCCG CCCGCCTGGA CGTCGAGCAT
CGCCTGAACA CGGAAGCTCA GGCCCGCCGG TTCGCGACTT TGTTGAGAGA GCTCTAG
 
Protein sequence
MVECNLALHY SGGLDANKWR ERHGRGEVPD ALPYGFDRLA RYSVKVSPVS LRVSRDAFFV 
RGPRIFGGYE WVETAASRRT VGGSDVVLCW DERVGVPAAL MAGTRNHPPV VTGVIWLADV
AGRRPGSLKI ARCALRRSSR IFTLSTAQIP PLRTQLGIPE GRFSHVLFGV DSTFFRTSVN
PPKKGLVVSV GNDRHRDFET LIRGLSQGWS RLAELRLAEL RLELVTARNV AVPPWLGRRR
NRLTLPEVSD LYRAASMVVV CLRPNLHVSG VTVVLEAMAS GRPVVVTDTP GIRDYVDHGR
TGILVPPYDT DALAEAVVHL VLDPDRAAAL GAAARLDVEH RLNTEAQARR FATLLREL