Gene Franean1_0118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0118 
Symbol 
ID5668543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp140647 
End bp141792 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content75% 
IMG OID641239046 
ProductUspA domain-containing protein 
Protein accessionYP_001504491 
Protein GI158311983 
COG category[T] Signal transduction mechanisms 
COG ID[COG0589] Universal stress protein UspA and related nucleotide-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG GCGCCCTGAC CGGAACGCAG GCGGACCAGA CCCCGGAACG AACCGGCGCG 
CCGGCGATCG AGCGGCACAG CGGACCTACG GTGGGCACAG CAGCCGACAT AGTGGTCGGG
ATCGACGGCT CCCCCGGCTC CGCGGCGGCG TTGACCTGGG CGGTCGCCGA GGCCAGCCGG
CGCGGCCTGC GGGTGCGCGC CGTCCTCGGG TCCTGCGCCG ACGAGCAGCC CACCGCTGTA
CGCAGGTCCG CCGACGCGAT CGCCGGGCCG CACGACGAGG CCACCTTGGC CTTCGCGGCC
AGCCACCTCC TGCATGAGAC GATCGGCGCC GCCCCCATCC CCGCGGGACT CGAGGTCCTC
GAGGAGGTGG TGGACGCTCC TGGCGCCGAG GCCCTGCTCA CCGCCGGCCG GGACGCCGCC
ATGATTGTCG TCGGAGCACG CGGGCGCGGG CTCCTGCACC GCCTGCGGCT CGGATCGGTG
AGTACGTCCG TGGCCGTCCA TTCCCCCGTG CCAGTGGTCG TCGCGCGACT CCCCCGTTCG
GGGGATGCCG GCGAGCCCGA TGCGGACGGC CTCGCCGGTG CCGGGCCGGT CGCCGACGAG
CGGCTGAGCC CCACGAGCGC ACCGAGGCAG GGCACGCCGC ACCGGCGGCC GGTGGTCGTC
GGGGTCGACG GCTCACCCAA CTCGCTGGCC GCGCTGCGGT GGGCCGCGGT CACGGCGGCA
CTGCGTGGGG CACCGCTGCA TGTCATCCAC AGTTGGCTCG CCGCGGTCCC CCTCCCGTTC
GCCGAGACGT CCGGGGAGAT CGTGCAGGCG CTCGAAGGCC AGGCGCGGGC CGTGCTGGAC
GAGTCCATCG AACAGGTCCT CGGCCCGATC CCCGGCGGCG AGCCAGGGGA GCCCGCCGAG
CCGGGCGGTA CGGAGCCCGC CGTGCTCCGG CTCGCCGCTC CGGCACCCGG CTCCGGGGAG
ATCGACGTCT ACCGTCAGCT GATACCCGCC TCTGCCACCC GGGCCCTCCT CGAGGCCAGC
CACGACGCCG ACCTGCTGGT CGTCGGAGCC CGGGGCAAGG GCGGATTCGC CGAGCTCCTC
CTGGGCTCGG TCAGCCACCA GACGATGCTG CACTCCGCCG CCCCCGTAGC GATCATCCGG
GCCTGA
 
Protein sequence
MTDGALTGTQ ADQTPERTGA PAIERHSGPT VGTAADIVVG IDGSPGSAAA LTWAVAEASR 
RGLRVRAVLG SCADEQPTAV RRSADAIAGP HDEATLAFAA SHLLHETIGA APIPAGLEVL
EEVVDAPGAE ALLTAGRDAA MIVVGARGRG LLHRLRLGSV STSVAVHSPV PVVVARLPRS
GDAGEPDADG LAGAGPVADE RLSPTSAPRQ GTPHRRPVVV GVDGSPNSLA ALRWAAVTAA
LRGAPLHVIH SWLAAVPLPF AETSGEIVQA LEGQARAVLD ESIEQVLGPI PGGEPGEPAE
PGGTEPAVLR LAAPAPGSGE IDVYRQLIPA SATRALLEAS HDADLLVVGA RGKGGFAELL
LGSVSHQTML HSAAPVAIIR A