Gene Franean1_3324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3324 
Symbol 
ID5671696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3936731 
End bp3938194 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content78% 
IMG OID641242213 
Productglycosyl transferase family protein 
Protein accessionYP_001507633 
Protein GI158315125 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.624929 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACGC CCGTGGCCCC GCTGCCGGTC GGCTTCCGGG TCGTTCTCGA CATGTCGGCA 
CGGCGGCTGA GCGCCGACAG CTGGCTCGGC GGCTCGCCGG CCAGGGTGAT CCGGCTGACC
GCGGCCGGCC AGGCCGCCTG GCAGGAGCTC GCGACCGGCC CGGTGGTGTC CCCGCGGGCA
GGCGCCCTGG CCCGCCGGCT CACCGACGCC GGCCTGGCAC ATCCCAGGCC GCCCACGCCG
CGGCACGACC CGGACATCAC CGTCGTGATC CCGGTCCACG ACCGCGTCGA CAAGCTGGCC
CGGTGCCTCG CCGCAGTGGG CGACCGCCAC CCGGTCGTCC TGGTCGACGA CGGCTCGCGC
GAGCCCGACG CGATCATCGA GCTCGCCGAC CGGTTCGGCG CGAAGGTGAT CAGGCGCCCC
GTCAACGGCG GGCCGGCGGC GGCCCGCAAC ACCGGGCTGG CGGCGACCGC TGGCGAGCTC
GTCGCCTTCG TGGACAGCGA CTGCGTGCCG CCGGCGGGCT GGATCGACGC GCTGGCCGCG
CACTTCGCCG ACCCGCTGGT CGGCGCCGTG GCCCCGCGCA CGGTCCCCGC TCCCGGCACG
CCGGGCGGCT GGGCCGGCCG GTACGCCGGC ACCACACGCA GCCTCGACCT CGGCGGCACG
CCGGCCCGGG TCGGGTCGAA CACCCGGGTG GCCTACGTCC CGACCGCCGC GATCCTGGTC
CGCCGCGCGG CGCTGGCCGA GATCGCCGGC GGCGGTCCGG CGGCCGGCGG GGCGTTCGAC
ACCACGCTGT CGGTCGCGGG CGAGGACGTC GACCTGGTGT GGCGGCTGGA CAAGGCGGGC
TGGCGCATCC GGTACGACCC GACCGTCGAG GTCCGGCACC TGGAACCGGA GACCTGGGCC
GGGCTGCTCG GCCGGAGGTT CCGGTACGGC ACGTCCGCCG CGCCGCTGGC GCTGCGCCAC
CCGGGATCGC TGCCCCCGCT CGTCCTGTTC CCGGGGCCGG CGCTGACGGT CGCCGCGCTG
CTCGCCCGTC GGCCCGTGCT GGCCGCCGCC GCGTACACCT GTTCGGTACT GCGCACCGTG
CGGACGCTGC GCCGGTCAGA CCTGCCCGTC CGGGAGGTGG CGCGCGCGAC GGCAGGTGCC
GTCGGCCGGA CCTGGCTCGG CGTCAGCCGG TACGGCACCC AGTACGCCCT GCCGCTGCTC
GCGGCCGGCG CCGCGGGTGG CGGCCGCCGG CGCTGGGGAC GTCGGGCGGC GGTGGCATCA
CTGGTCGTCG GCCCGGCCCT GGCGGAGTGG GCGGGCCGGC GCGGGTCGAT GGACCCGGTG
CGGTTCGTGC TCGGCCGTCT CGCCGAGGAC GTCGCCTACG GCAGCGGTGT GTGGACCGGG
TGTGTGCACA ACCGGACGAC CATCCCGGTG CGCCCCACGA TTGGCCGGCG CGCCCACGGG
TCGAGAGGAC CCGACCATAG ATGA
 
Protein sequence
MTTPVAPLPV GFRVVLDMSA RRLSADSWLG GSPARVIRLT AAGQAAWQEL ATGPVVSPRA 
GALARRLTDA GLAHPRPPTP RHDPDITVVI PVHDRVDKLA RCLAAVGDRH PVVLVDDGSR
EPDAIIELAD RFGAKVIRRP VNGGPAAARN TGLAATAGEL VAFVDSDCVP PAGWIDALAA
HFADPLVGAV APRTVPAPGT PGGWAGRYAG TTRSLDLGGT PARVGSNTRV AYVPTAAILV
RRAALAEIAG GGPAAGGAFD TTLSVAGEDV DLVWRLDKAG WRIRYDPTVE VRHLEPETWA
GLLGRRFRYG TSAAPLALRH PGSLPPLVLF PGPALTVAAL LARRPVLAAA AYTCSVLRTV
RTLRRSDLPV REVARATAGA VGRTWLGVSR YGTQYALPLL AAGAAGGGRR RWGRRAAVAS
LVVGPALAEW AGRRGSMDPV RFVLGRLAED VAYGSGVWTG CVHNRTTIPV RPTIGRRAHG
SRGPDHR