Gene Franean1_7040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7040 
Symbol 
ID5675351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8590899 
End bp8591960 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content66% 
IMG OID641245886 
ProductLacI family transcription regulator 
Protein accessionYP_001511277 
Protein GI158318769 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTCAGA TAGGGATCGC CGACGTGGCG GCCGCCGCAG GCGTCTCCCG AACGACCGTT 
TCGCTCGTGC TCAACAACGT CGAGGCACGC ATCTCCGACG AGACGAGAGA ACGTGTGCGC
CGGGTCGCGG CGGAGATCGG GTACGTGCCC AATCCGGTCG CACGCAGCCT CCGTACGCGT
CGCACGCGGA CGATCGGCCT GATCTCTGAC CGGATCGCGA CGACACCGTT CGCCGGCCGC
CTGCTCGCCG GCGCGCAGGA TGTGGCACGC GAGCATGAGT ACCTCGTGTT CCTCGTCAAC
ACCGACGGCG ACTCCGAGAT CGAGCGGGAA GCCATCCGTG CGCTCTCAGC CCACCAGGTC
GACAGATTTG TTTACGCCTG CATGTGGCAT CGAGAGATTT CTGTTCCCGC AGGGCTTCCA
GAGTCGACCG TGTTTCTCGA CTGCCGGCCT CTTGCGGACG GCTACTCCGC TGTCGTGCCA
GACGACCATG CCGGCGGTGT CGCCGCGGTC CGAGAGCTGG TCATGGCCGG CCATCGCCGG
ATTGCCTATG TCGACACGAC GGAGAAGGAC CGTCCCGTTG CCGCTGAGCT GAGACACCGC
GGCTATCTCG AGGTGCTGCG CGAGGCCGGC ATCGGCGCCG ACCCACGGCT TCATGTCACG
TTCGAGACCT CGGCACTCGG CGGACGGCGG GCAACCGAGG CCCTTCTCGA CCTGCCTGTC
GACGTTCGGC CCACTGCCGT CTTCTTCTTC AACGACCGGA TGGCCATGGG CGCCTACGTT
GCAGCTCATA CCCGAGGGCT CGAGATCCCT CGGGATCTTT CGGTGTTCGG TTACGACGAC
CAGCAACTGG TGGCGGCCGA ACTCGACCCG CCACTGTCCA CCATCGCCTT ACCGCACTAC
GAGATGGGTC GCTGGGCGAT GGAGATCGTG CTCGGGGTGC GGGAGGCCCC GAACGACACG
TTCCTCATGC CATGCCCGGT CATCCGCCGG GCATCCGTGG GTCCGCCGCC GGCCTCGACT
TCCACATCCA GTCGGCGGCG GACCTTTTTA GCAACCGAGT GA
 
Protein sequence
MTQIGIADVA AAAGVSRTTV SLVLNNVEAR ISDETRERVR RVAAEIGYVP NPVARSLRTR 
RTRTIGLISD RIATTPFAGR LLAGAQDVAR EHEYLVFLVN TDGDSEIERE AIRALSAHQV
DRFVYACMWH REISVPAGLP ESTVFLDCRP LADGYSAVVP DDHAGGVAAV RELVMAGHRR
IAYVDTTEKD RPVAAELRHR GYLEVLREAG IGADPRLHVT FETSALGGRR ATEALLDLPV
DVRPTAVFFF NDRMAMGAYV AAHTRGLEIP RDLSVFGYDD QQLVAAELDP PLSTIALPHY
EMGRWAMEIV LGVREAPNDT FLMPCPVIRR ASVGPPPAST STSSRRRTFL ATE