Gene Franean1_6655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6655 
Symbol 
ID5674970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8083047 
End bp8084219 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content71% 
IMG OID641245506 
Productputative glycosyl transferase 
Protein accessionYP_001510898 
Protein GI158318390 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.687261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTGT CGAAGATGGA CGGGATCGAG GGGATCATGG ACGACGTCAC GGCGGAGAGC 
GAGCCCGAGT CTCGGCGCGG GAACGTGCTG CTGCTCTCCG GCTCCCTCGG AATGGGCCAC
GACGTGATGG CCGAGGCCTG CGCGACGTCG CTGGAGCGCC GCGGGTTCCA CACCCGCACG
GCGGACTGCC TGCGCATGAC GGACGGCCGC AACGGCACGC TCGGGCAGAA CGCCTTCCGC
GGGCTGATCG CCGTCCCGGG GGTGTACGAC GCGCTGCACT TCTCGCAGCT ACGCACGGGC
GGGCGGGTGG CGCGGGCCAT CGAGCGGACG TCCAGCCACT TTCTCGTGCC GCGGCTGCGC
GAGGACATCC GGGCCGAGCC GGCCGACCTG GTGATCTCCA TCTTCGCCTC CGCCGCGGCG
GCGGTCAGCC GGCTGAAGTC GGAGTTTCCG GGGATGACCA CGGCCGTGTT CTCCTCGGAC
TGCTGCGTGC ACCGCATCTG GGTGCAGGAC AACACCGACC TGTTCCTGGT GACGTCCCAG
ACCGCGGCCC GCTACGTGCG CCGGTTCGCC CCGCAGGCCC GGATCGCCGT GGTGCCGACC
CCGGTGCGGA CGCCGTTCTA CGACCCGCCG ACCCAGGAGG AGGCCCGGGG CAATCTCGGC
ATCCCGCTGG AGAGCCGGTG CGTGCTGCTG ATGTCCGGAT CGTGGGGCCT CGGCCCGTTG
GTCGAGGCCG CCGAGGCGCT CGCCGCGGCC GGGGTGTGGG TGCTCGCCGT CGCCGGCCGC
AACGAGAAGC TGGCCGCGCG GCTGTCCGCC CTCGCGCAGC GCGACCACCG GGTGATCCCG
TTTGGATTCA CCAACCGGAT TCCCGAGCTC ATGGCCGCAA GTAACCTGGT GGTGACCTCA
TCGGGGGATA CGTGCAGCGA GGCTCGCGTG ATCGGGCGCG ACCTGCTGTT GATGGACGTC
GTCCCCGGCC ACGGACGGGA CAACCTGCAG AAGGAGCTCG ACCGCGGCCA CGCCGAAGTG
ACCAGTACGG ACTCGCTGTC CCTGACCCGG TCGGCACTGG CCTGTCTGGA CCGGGTCAAG
CCGCCCTCGC AGCGGGTCGC CGTCAGCCCA CAGGCCTGGG AGCAGGCATT CGGTGCGGCG
CTGGCCCAGA TCGGGCTGGG CGTCCACTGG TGA
 
Protein sequence
MSVSKMDGIE GIMDDVTAES EPESRRGNVL LLSGSLGMGH DVMAEACATS LERRGFHTRT 
ADCLRMTDGR NGTLGQNAFR GLIAVPGVYD ALHFSQLRTG GRVARAIERT SSHFLVPRLR
EDIRAEPADL VISIFASAAA AVSRLKSEFP GMTTAVFSSD CCVHRIWVQD NTDLFLVTSQ
TAARYVRRFA PQARIAVVPT PVRTPFYDPP TQEEARGNLG IPLESRCVLL MSGSWGLGPL
VEAAEALAAA GVWVLAVAGR NEKLAARLSA LAQRDHRVIP FGFTNRIPEL MAASNLVVTS
SGDTCSEARV IGRDLLLMDV VPGHGRDNLQ KELDRGHAEV TSTDSLSLTR SALACLDRVK
PPSQRVAVSP QAWEQAFGAA LAQIGLGVHW