Gene Franean1_2309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2309 
Symbol 
ID5670707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2757748 
End bp2759346 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content72% 
IMG OID641241228 
Producthypothetical protein 
Protein accessionYP_001506649 
Protein GI158314141 
COG category[S] Function unknown
[T] Signal transduction mechanisms 
COG ID[COG2013] Uncharacterized conserved protein
[COG2310] Uncharacterized proteins involved in stress response, homologs of TerZ and putative cAMP-binding protein CABP1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.739249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAACGC AGTTCAGTCG AGGGCAGAAG TCCCAACTGT CGGCAATCAC CGCTGGCACC 
GATCTGTACA TCGGGATCCA GATCAACGCG CCGGGCGAAT GGGACGTGTC CTGTTTCGGG
CTGGACGGCG CCGGCCGGCT CTCCGACGAC CGGTACTTCA TCTTCTTCAA CCAGCCGAAC
TCGCCCGAGT CCTCGATCCA GCTCCTCGGC GCCCAGTCCG GTGACACCCA GTCGTTCCGG
GTGACGCTCG ACAAGGTCCC GGACGCGATC CAGAAGCTGT CGTTCTGCGC GGCGCTGGAC
GGCCCGGGCA GCGCCGCGCA GATCACGTCC GGTTACCTGC GCATCGTGGC CGGTGGTACG
GAGGTGCTCC GCTACGCGTT CACCGGCGCC GACTTCTCCG ACGAGCGCGC CGTCATGATC
GGTGACGTCT ACCGCAAGGG CGTCTGGCGC GTCGCGGCGG TCGGCCAGGG GTTCCGGGGT
GGCCTGGCGG AGCTGATCCG CAGCTACGGC GGCGAGGTCG CCGACGAGCC CGAGCCCACG
CCCGCTCCGG CGGCGCCTGG GTTCGGCGCG CCGCCCGCGC CCGCCTTCAG CCAGCAGAAG
CCGCCTGCCG CGCCGGGCTT CGGCGCCCCG CCCGGGCCCC CGCCCCCGCC CCCGCCGCCC
CCGCCGGCAC CGCCGGCACC GGCGCATCCC GGCCAGGGCT ACGGTCAGCC GCAGCCGGCC
TACGGCCAGG CCGGCGGCGC GCAGCAGGGC TACGCCCAGC CGGGCTACGC TCAGCCACAG
CCCCCGCCGG CGATGCCCGG CGCCCAGGGG TACGGCGCGC CCGGCCAGCA GCCACCCGGC
CAGCCGATGC CGGGCCACCC GATGCCCGGC CAGCAGCAGC CCGGTCAGCC GATGCCGGGC
GGGTTCGGGC CCACGGAGGT CCTGCCGTCG CAGGCCCGCC CCGTCCAGCC CGGTGCCATG
AACAGCCTCA ACCCGTACCG CGAGGTGCCG ACCGCGGGGC GCTGGACCCA GCAGAACAGC
AAGCTGGTCA AGGTCACCCT GGGCCCGGAG GCGCTGGCGC TGCGCGGTTC GATGGTCGCC
TACCAGGGCA ACGTCGAGTT CGACTACAAG AGCGGCGGGA TCCGCGGGCT GATCGAGGAG
AAGCTCACCG GCCAGGGCCT CAAGCTCATG ACGTGCAAGG GCAACGGCGA GGTCTTCCTT
GCCCAGGACG CCTCCGACCT GCACATCGTC GAGCTCGGCA ACCAGTCGCT GTGCATCAAC
TCGAAGAACC TGCTGGCGAT GGACGCCACC GTGCGCTCGG AGGTCCGCCG CATCGAGAGC
CCCGGCATCC CCGGCGGCGG CTTCTTCCAC TTCGAGGTCT CCGGGCCCGG GTCGGTCGTC
GTGATGACCA AGGGCACGCC GATGACCCTC AACGTCGCCG GTCCCACGTT CGCCGACATG
AACGCGCTGG TGGCGTGGAC GTCCGGCATG CGGGTGAGCG TGTCCACCCA GGTCCGCATC
TCCCGCCAGA TCTACGCGGG AGCCAGCGGC GAGTCGTTCG CGTTGCAGTT CATGGGGTTC
GCCGGCCACT TCGTCGTCGT CCAGCCGTAT GAGGTCTGA
 
Protein sequence
MATQFSRGQK SQLSAITAGT DLYIGIQINA PGEWDVSCFG LDGAGRLSDD RYFIFFNQPN 
SPESSIQLLG AQSGDTQSFR VTLDKVPDAI QKLSFCAALD GPGSAAQITS GYLRIVAGGT
EVLRYAFTGA DFSDERAVMI GDVYRKGVWR VAAVGQGFRG GLAELIRSYG GEVADEPEPT
PAPAAPGFGA PPAPAFSQQK PPAAPGFGAP PGPPPPPPPP PPAPPAPAHP GQGYGQPQPA
YGQAGGAQQG YAQPGYAQPQ PPPAMPGAQG YGAPGQQPPG QPMPGHPMPG QQQPGQPMPG
GFGPTEVLPS QARPVQPGAM NSLNPYREVP TAGRWTQQNS KLVKVTLGPE ALALRGSMVA
YQGNVEFDYK SGGIRGLIEE KLTGQGLKLM TCKGNGEVFL AQDASDLHIV ELGNQSLCIN
SKNLLAMDAT VRSEVRRIES PGIPGGGFFH FEVSGPGSVV VMTKGTPMTL NVAGPTFADM
NALVAWTSGM RVSVSTQVRI SRQIYAGASG ESFALQFMGF AGHFVVVQPY EV