Gene Franean1_3663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3663 
Symbol 
ID5672029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4339543 
End bp4341162 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content72% 
IMG OID641242546 
Productglycosyl transferase family protein 
Protein accessionYP_001507966 
Protein GI158315458 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1928] Dolichyl-phosphate-mannose--protein O-mannosyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.870133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.832443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCAG CAACAACAGC TGATTTCTCC CGCTCCGGAA CCGACCAGGC CGGTTCCGAC 
CGGCGGCGCC CGCCCGGGTC CGCGGCCGAC GCGCTGCGGA TGCGGCTGTG CCCGCCGATG
CCCGGTGACC GGGTGATCGG CTGGGTGGCC GCGCTCGCGG TCACCGCCGT CGCGGGCATC
CTGCGGTTCT GGCAGCTCAC CGAGCCGCGG GGCATGAAAT TCGACGAGGT CTACTACACC
AAGGACGCCT GGGGCCTGAT GACCTCGGGC TACGAGGTGA ACAGCGAGAC CTGCACCGGC
CCCGCCTTCG TGGTGCATCC GCCCCTGGGC AAGTGGTTCA TGGCCGCCTC CGAGAAGATC
TTCGGCTACA CCGACTGCGC CGGCGTCGCG CACGGCAGCC CAGAGCTCGG ATGGCGGTTC
GCCTCGGCGC TGTTCGGCAC GCTGGCGGTG CTGGTGCTCA CCCGCACCGC ACGGCGGATG
TTCCGGTCCA CCGTGCTCGG CTGCTTCGCG GGCCTGCTGC TGACCCTGGA CGGGCTGGAG
TTCGTCCAGA GCCGGATCGG CATCCTCGAC ATCTTCCTGA TGACAGGGCT CGTGCTCGCG
CTGGCCTGCC TGGTACTCGA CCGCGATCAC GGCCGGGCCG CGCTCGCCGC GCGCGTCGCC
GCGGGCCCGC CGTCCGGTGG CGCGCCGTCC AAGGCGACCG AACGCTTCGT CCGCTACGGG
CCGCGCGCCG GGCTGCGGCC CTGGCGGATC GCCGCCGGGC TGTGCCTCGG GGCGTCGATG
GGCGTGAAGT GGAGCGCGCT CTACACGCTG GTCGGTCTCG CCGCGCTGGC CCTGGCCTGG
GATGTCGGTG CCCGGCGCAC CGCGGGCGCG CGGCGGCCGG TGCGCGGGGC GCTGCGCCGC
GACCTGCCGG CCTGGTCCGG CTGTTACATC CTGCTGCCGA TCGTGACGTT CCTGGCCACC
TGGACGGGCT GGTTCGTCAC CGACGGCGGT TACAACCGGC ACAGGTACGG CGACGGGTTC
TTCGCCGCCT GGCACGGCTG GTGGGACTAC CAGCAGGACA TCCTCGACTT CCACGAGCAC
CTGAGCGCGC CGCACGTGGC GCAGTCCACG CCGCTGAGCT GGCTGGTGCT CGCCAGGCCG
GTGGTCTACG CCTACGACAG CCCGAAGCTC GGCGAGCGGG GTTGCCACGC CGCCGCCGGC
TGCTCCCGCG AGGTGCTGGC CCTGGGCAAT CCGGCGGTCT GGTGGGTCGG CACAGCCGCG
CTGGTCGCGA TGCTCGCGCT GTGGGTCAGC CGGCGCGACT GGCGGGCCGC CCTGGTACTC
GTCGGCTTCG GCTCGTCGTT CCTGCCGTGG CTGGCGTTCC CCAACCGGAC GATGTTCTTC
TTCTACGCCC TGCCGTCGCT GCCGTTCCTG ATCCTCGGGA TCACCGCGTC CGCCGGCCTC
GCGCTCGGCC CCCGCGACGC GTCGGACACC CGCCGCATGA TCGGGGCGTT GTCGTTCGGG
CTCTACCTGG CCGCCGTCGT GCTGATGTTC GCCTACTTCT ACCCGATCCT CGCCGCCCAG
ACGATTCCGC TGAGCTCCTG GCGCGACCGC ATGTGGTTCC CCGGCTGGAT CGTCGCCTGA
 
Protein sequence
MTAATTADFS RSGTDQAGSD RRRPPGSAAD ALRMRLCPPM PGDRVIGWVA ALAVTAVAGI 
LRFWQLTEPR GMKFDEVYYT KDAWGLMTSG YEVNSETCTG PAFVVHPPLG KWFMAASEKI
FGYTDCAGVA HGSPELGWRF ASALFGTLAV LVLTRTARRM FRSTVLGCFA GLLLTLDGLE
FVQSRIGILD IFLMTGLVLA LACLVLDRDH GRAALAARVA AGPPSGGAPS KATERFVRYG
PRAGLRPWRI AAGLCLGASM GVKWSALYTL VGLAALALAW DVGARRTAGA RRPVRGALRR
DLPAWSGCYI LLPIVTFLAT WTGWFVTDGG YNRHRYGDGF FAAWHGWWDY QQDILDFHEH
LSAPHVAQST PLSWLVLARP VVYAYDSPKL GERGCHAAAG CSREVLALGN PAVWWVGTAA
LVAMLALWVS RRDWRAALVL VGFGSSFLPW LAFPNRTMFF FYALPSLPFL ILGITASAGL
ALGPRDASDT RRMIGALSFG LYLAAVVLMF AYFYPILAAQ TIPLSSWRDR MWFPGWIVA