Gene Franean1_6996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6996 
Symbol 
ID5675307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8523354 
End bp8524562 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content67% 
IMG OID641245842 
Productstress protein 
Protein accessionYP_001511233 
Protein GI158318725 
COG category[T] Signal transduction mechanisms 
COG ID[COG2310] Uncharacterized proteins involved in stress response, homologs of TerZ and putative cAMP-binding protein CABP1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCTGA CGATGCTCCC CCAGATCGCG CTCCGCCCGC GGCGGATCCA CGTGGTTCTG 
AACTGGGTGA AAACCCCTAC CACACCCCAG CTCGACCTCA TCCTCGTGCT CCCCGCGAAC
AATGCCGGGG CGGCGGGGTC CGCGCGCGCG ATTCAGTTCG CCGATCCTAC GGATCCCACC
GACTCCAGCG CGGCGGTACG GCATCTGAGC AGGACGGACG CGGCGGTGTC CACCGACCGA
GCCCAGGTCG ACCTGAACGC CGTCCCCCCG CGGTACGACC AGATCCTGAT GGCCCTGTCG
GCCACGGGCG GCCGAGTGGT CGACGTGCCC GAAACACGGC TGCGCCTGAT CGACGCGGAA
GACAACACGG AGATACTCCG GATCGATCTC GCCCCGGGCC CGGCGGACCT CGTCCACATA
CCCGGCGAGC TGTACCGGCG TGCGGACGGC TGGTACTACC GGGCGGTCGG GTTCGGATTC
ACGGACGGGC TGACCGGTCT GAGCTTCCGC TTCGCGGTCC CGATCAACGA CATACTGCGG
CGTCAGCAGG TTCCTGCCAC CGATGTGGCA CCGGACGCGG CCGAGAGCAA GGAAGAGAGC
GGGTCGGAGA AGGCCGAACG GCGCGCCTCC TCCGCTCGTG CGGTAACTCC ACTCGGGGAC
GTGAAGCTGC TGCCCGGCGC GTCCGCGACC ATCAAAAGGC CCCAGCACGA GATCACAGCG
GAGCTGACCT GGCGACGCAA GGACAAGGAC CTCGACCTCT ACGCGCTCTA CATCGACAGC
GACGGTCGCG AGGGTGTCTG TTACTACCGT GACCAGGGCT CCCTCAAGCG GCCGCCGCAC
ATCTGCCTGA CGACCGGAGA CCGTCATCGG GGCCGGGAGG CGATCGTGAT CGCCCAGCCG
AGCGCCTTCC GACACATCCT CATCTGCGCG TACTCGGCGG TCGAGAACGG CATCGGGTCG
TTCCGTGGAT TCCGGGCGGT CGTGGAGGTC GACGACCATG CCGGTTCGGT GATACAAACA
CCTCTGTACC ACCGGAACAG CTTCTCCTAC TGGGTTGCGA TAGCGCGCAT CGACCTCACC
GCAGAGGAGG AGGCCGTCAT CGAACACGTG GAGACATACT CGCGTCCCCG CAGCGAGCGA
CGACCGGTCC TCCGTGGTGA CGGAACCTTC GTCATGGACG CAGGGCGAGT GGAGTTCAAG
ACACGTTGA
 
Protein sequence
MSLTMLPQIA LRPRRIHVVL NWVKTPTTPQ LDLILVLPAN NAGAAGSARA IQFADPTDPT 
DSSAAVRHLS RTDAAVSTDR AQVDLNAVPP RYDQILMALS ATGGRVVDVP ETRLRLIDAE
DNTEILRIDL APGPADLVHI PGELYRRADG WYYRAVGFGF TDGLTGLSFR FAVPINDILR
RQQVPATDVA PDAAESKEES GSEKAERRAS SARAVTPLGD VKLLPGASAT IKRPQHEITA
ELTWRRKDKD LDLYALYIDS DGREGVCYYR DQGSLKRPPH ICLTTGDRHR GREAIVIAQP
SAFRHILICA YSAVENGIGS FRGFRAVVEV DDHAGSVIQT PLYHRNSFSY WVAIARIDLT
AEEEAVIEHV ETYSRPRSER RPVLRGDGTF VMDAGRVEFK TR