Gene Franean1_5892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5892 
Symbol 
ID5674214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7154705 
End bp7156135 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content72% 
IMG OID641244741 
Productglycosyl transferase family protein 
Protein accessionYP_001510143 
Protein GI158317635 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.861107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACC GGGCGACGAG CAACGGAGCG ATGACCACCA TGGCCATGAA CGTGGCCATG 
ACGGGCCAGG CGGCACCCGA GCCGCTCGCC ACGATCGTCA TCGTGAACTG GAACGGTGCC
CACCTGCTCC CGCCCTGCCT GGACGCCGTC GCCAAGCAGG ACGCGCCGTT CACCTTCGAG
ACGCACGTGG TCGACAACGC CTCGGCCGAC GACTCGCGCG AGGTGCTGGC ACAGCGCTAC
CCGTGGGCGA AGCTCGTCCC CTCGGACCGC AACCTCGGCT TCGCCGGCGG CAACAACCTC
GCCCTGCGAG GTGTCACGAC CCCGTACGCG GTACTGCTGA ACAACGACGC GATCCCCGAG
CCGGACTGGC TGGCCCGGCT GCTGGCCCCG TTCGCCGAGC CGGGCGGCAA CCGGCTCGGC
GCCGTCACCG GCAAGGTCGT GTTCCTCCCC CGCTTCCTGC GGCTGACCCT GTCGACGCCC
ACCTTCTCGC CGGGGCCGCA CGACCCGCGC GAGCTGGGCG TCCGGGTGAG CTCGGTGACG
GTGAACGGCC GCGAGGCGCT GCGCGAGGTG CTGTGGGAGA AGCTCACTTT CGGCGCCGAG
GGCCCGGCGG ACGCGCCGTT CTTCTGGACC CGCGGCGAGG GTGAGCTGTG TGTGCCGGTG
CCCGAGGGCG GCCCGGTCAC CATCGGCCTC ACCTGGGCCG CGGACACCGC CAAGCAGGTC
ACGCTGGGCT GGGCCGGCGC CGGTGACACC ACCGCGACGC GCACGCTGCC GGTGGGCACC
GAGCCGTCCG CCGTGTCGTT CACCGTGGAC GAGGGCGCCC CGCGCGTCGA CGTGATCAAC
AACGTCGGCG GGATCGTGCT CACCGACGGC TACGGCGCCG ACCGCGGCTA CCAGCAGATC
GACACCGGCC AGTTCGACAA CCCCGAGGAG GTCTTCACCG CCTGCGGCAA CGGCATGGCG
ATGCGGACCG AGCTCGGCCA GGCGCTCGGC TGGTTCGACG ACGACTTCTT CCTCTACTAC
GAGGACACCG ACCTCTCCTG GCGCATCCGG GCCCGCGGGT ACCAGATCCG CTACGTCCCG
GGCGCGGTGC TGCGGCACGT CCACTCGGCG TCGAGCGTCG AGTGGTCCCC CCTGTTCGTG
TTCCACACCG ACCGCAACCG GCTGCTGATG CTGACCAAGG ACGCGACCGT GCGCACGGCC
GTCTCGGCGG TCACGCGCTA CCCGCTGACC ACCGCGTCGA TCGCCGTGCG GACCTGGCGA
CAGGCACTGC GCTCGCGCAG CCGCCCGGCG GTGCGGCCCA CCGTGCTGCG GGTTCGGGTC
TTCGCCTCCT ACCTGCGGCT GCTGCCGGCG ATGCTGCGCC GCCGCCGGGA GATCGGCGCG
ACCGCCACCG AACGCCGGGT CAGCCTGCAG AGCTGGCTGG TGGAGCGATG A
 
Protein sequence
MTDRATSNGA MTTMAMNVAM TGQAAPEPLA TIVIVNWNGA HLLPPCLDAV AKQDAPFTFE 
THVVDNASAD DSREVLAQRY PWAKLVPSDR NLGFAGGNNL ALRGVTTPYA VLLNNDAIPE
PDWLARLLAP FAEPGGNRLG AVTGKVVFLP RFLRLTLSTP TFSPGPHDPR ELGVRVSSVT
VNGREALREV LWEKLTFGAE GPADAPFFWT RGEGELCVPV PEGGPVTIGL TWAADTAKQV
TLGWAGAGDT TATRTLPVGT EPSAVSFTVD EGAPRVDVIN NVGGIVLTDG YGADRGYQQI
DTGQFDNPEE VFTACGNGMA MRTELGQALG WFDDDFFLYY EDTDLSWRIR ARGYQIRYVP
GAVLRHVHSA SSVEWSPLFV FHTDRNRLLM LTKDATVRTA VSAVTRYPLT TASIAVRTWR
QALRSRSRPA VRPTVLRVRV FASYLRLLPA MLRRRREIGA TATERRVSLQ SWLVER