Gene Franean1_2155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2155 
Symbol 
ID5670555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2583310 
End bp2584593 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content63% 
IMG OID641241076 
Productglycosyl transferase group 1 
Protein accessionYP_001506497 
Protein GI158313989 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.824181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGATCC TGTTCGTCGC AGACTGGCGC AGCATCATCG CCCGGCAGTG GGTGCTAGGC 
ATCTGCTCAC TCGGACATGA ATGCCATGTC GTGTCTTCCT TCCCGACCGA AGCGGGTGAC
GATCCGGTAT GTATCGACGA GATTCCGATC GCGCTGGCAT CGCTAGCCGG CGCTATCTCT
CGCAGACACA GGACGCTTCC CTCAAGCGGA CCGGGCACCA ACAGTGATCC GGGCGCAGAC
GGCCAACCCG TGCCACCCAC GCGCGACCTG CGCCGGTCCT CCAGCCGCCG GAGCGGCCGG
CCTTCGTCGC TCGCCATAGA GCAAATGCGG ACTTCGGTGC ATCGGCATGT GGCACCGCGC
GATCTAGTAC GTCACCTTCC AGCTGCCCAA AAGGTCGTCC ATGCCTTTCA GCCGCAGGTT
GTACATGCCC TGCGTATTCC TTTTGAAGCG ATGTTCGGTA CGCTGCTGGC AGGCGGGCAT
CGGACCGTCG TCTCGATCTG GGGAAATGAC CTGACCCTGC ACGCGCCGAC GACCAAACTG
ATGCGGCGGC ACACGTGGCG GACCCTCACC AAGGCCGACG GACTCATCTC GGACTGCGTA
CGGGACATCA ACCTGTCCCG GTCCTGGGGA TACCCGGCCG GGCGGCCGAC CGTTGTTCTC
CCCGGCAACG GCGGCATCTC GGTACCCACT AACGTCAGAG ATCTTGGTAC ACGCACCCGG
AGCGAGATGG GGTTGGCACC GGAGACACCA GTGGTGTTCA GCCCACGAGG TCCACGCGTG
TATCTACGCC TTGCCAACTT CATACACGCG CTTCCTAGCA TCCTCCGGGC GGACCCGAGG
GTGGTCTTCG TCTTCGCGGA CTGCACGCAT AACAGGCTTC GCGATCTGGC GAGGGACCTC
GGGGTGGAGA AGAGCTGCCG GTTCCTTTCC CACCAGTCCG CAGACCGGAT GCTCGAGCTT
TTCGGCGCCG CAGATGTGTT TGTCTCCCCG AGCGTCCACG ACGGAACACC GAACACCCTG
ATCGAAGGGA TGTCGGCTGC TTGCTTCCCA GTAGCTGGCG ACACGGCATC CATCCGCGAG
TGGATTGTAC CTGGAGAGAA CGGCCTGCTC TGCGATCCGG AAAGTTCGGC AGACATCGCG
GCGAAGGTCG TTTCCGCGCT GGCGGACCGC GCCCTCCGTA AACGGGCGAC CGAACAGAAC
AGAACCCTTG TAACCATGCG CGCCGGTCGG GCCACCACCC TGCAGAGCGC CGATGAATTT
TATCACCAGA TTGCCGGGCT CTGA
 
Protein sequence
MRILFVADWR SIIARQWVLG ICSLGHECHV VSSFPTEAGD DPVCIDEIPI ALASLAGAIS 
RRHRTLPSSG PGTNSDPGAD GQPVPPTRDL RRSSSRRSGR PSSLAIEQMR TSVHRHVAPR
DLVRHLPAAQ KVVHAFQPQV VHALRIPFEA MFGTLLAGGH RTVVSIWGND LTLHAPTTKL
MRRHTWRTLT KADGLISDCV RDINLSRSWG YPAGRPTVVL PGNGGISVPT NVRDLGTRTR
SEMGLAPETP VVFSPRGPRV YLRLANFIHA LPSILRADPR VVFVFADCTH NRLRDLARDL
GVEKSCRFLS HQSADRMLEL FGAADVFVSP SVHDGTPNTL IEGMSAACFP VAGDTASIRE
WIVPGENGLL CDPESSADIA AKVVSALADR ALRKRATEQN RTLVTMRAGR ATTLQSADEF
YHQIAGL