Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6996 |
Symbol | |
ID | 5675307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8523354 |
End bp | 8524562 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641245842 |
Product | stress protein |
Protein accession | YP_001511233 |
Protein GI | 158318725 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2310] Uncharacterized proteins involved in stress response, homologs of TerZ and putative cAMP-binding protein CABP1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGCTGA CGATGCTCCC CCAGATCGCG CTCCGCCCGC GGCGGATCCA CGTGGTTCTG AACTGGGTGA AAACCCCTAC CACACCCCAG CTCGACCTCA TCCTCGTGCT CCCCGCGAAC AATGCCGGGG CGGCGGGGTC CGCGCGCGCG ATTCAGTTCG CCGATCCTAC GGATCCCACC GACTCCAGCG CGGCGGTACG GCATCTGAGC AGGACGGACG CGGCGGTGTC CACCGACCGA GCCCAGGTCG ACCTGAACGC CGTCCCCCCG CGGTACGACC AGATCCTGAT GGCCCTGTCG GCCACGGGCG GCCGAGTGGT CGACGTGCCC GAAACACGGC TGCGCCTGAT CGACGCGGAA GACAACACGG AGATACTCCG GATCGATCTC GCCCCGGGCC CGGCGGACCT CGTCCACATA CCCGGCGAGC TGTACCGGCG TGCGGACGGC TGGTACTACC GGGCGGTCGG GTTCGGATTC ACGGACGGGC TGACCGGTCT GAGCTTCCGC TTCGCGGTCC CGATCAACGA CATACTGCGG CGTCAGCAGG TTCCTGCCAC CGATGTGGCA CCGGACGCGG CCGAGAGCAA GGAAGAGAGC GGGTCGGAGA AGGCCGAACG GCGCGCCTCC TCCGCTCGTG CGGTAACTCC ACTCGGGGAC GTGAAGCTGC TGCCCGGCGC GTCCGCGACC ATCAAAAGGC CCCAGCACGA GATCACAGCG GAGCTGACCT GGCGACGCAA GGACAAGGAC CTCGACCTCT ACGCGCTCTA CATCGACAGC GACGGTCGCG AGGGTGTCTG TTACTACCGT GACCAGGGCT CCCTCAAGCG GCCGCCGCAC ATCTGCCTGA CGACCGGAGA CCGTCATCGG GGCCGGGAGG CGATCGTGAT CGCCCAGCCG AGCGCCTTCC GACACATCCT CATCTGCGCG TACTCGGCGG TCGAGAACGG CATCGGGTCG TTCCGTGGAT TCCGGGCGGT CGTGGAGGTC GACGACCATG CCGGTTCGGT GATACAAACA CCTCTGTACC ACCGGAACAG CTTCTCCTAC TGGGTTGCGA TAGCGCGCAT CGACCTCACC GCAGAGGAGG AGGCCGTCAT CGAACACGTG GAGACATACT CGCGTCCCCG CAGCGAGCGA CGACCGGTCC TCCGTGGTGA CGGAACCTTC GTCATGGACG CAGGGCGAGT GGAGTTCAAG ACACGTTGA
|
Protein sequence | MSLTMLPQIA LRPRRIHVVL NWVKTPTTPQ LDLILVLPAN NAGAAGSARA IQFADPTDPT DSSAAVRHLS RTDAAVSTDR AQVDLNAVPP RYDQILMALS ATGGRVVDVP ETRLRLIDAE DNTEILRIDL APGPADLVHI PGELYRRADG WYYRAVGFGF TDGLTGLSFR FAVPINDILR RQQVPATDVA PDAAESKEES GSEKAERRAS SARAVTPLGD VKLLPGASAT IKRPQHEITA ELTWRRKDKD LDLYALYIDS DGREGVCYYR DQGSLKRPPH ICLTTGDRHR GREAIVIAQP SAFRHILICA YSAVENGIGS FRGFRAVVEV DDHAGSVIQT PLYHRNSFSY WVAIARIDLT AEEEAVIEHV ETYSRPRSER RPVLRGDGTF VMDAGRVEFK TR
|
| |