Gene Franean1_3391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3391 
Symbol 
ID5671762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4016718 
End bp4018055 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content71% 
IMG OID641242279 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001507699 
Protein GI158315191 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0809012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCA CCAGCCGGTG GCGGCGCCTC CGGGACCATC ACCGCACAAG CACCGTGGTG 
ATCGCGGTTC TGGCCTTCGC GGCCGCCGGT CCCGCCAGCA GGCTCAACCC GCAGCATGTC
GCGCACGACA GTTCGCCGTG GCCGGCGGTG CTGCTCGCCG CCGTCGCCTG CACCGCACTG
CTCTGGCACG AACGCCATCC CCGCGCGACC GCCGTGGTGG CCATCACCTG CACCGTGTTC
CTGGCCGGCC TGGGCTACCT CCTCACCTCC CTGATGATCG CCCCAGCGAT GGCCGCGCTC
TACTGGCTGG CCGCGCACAC CGACCGCAGA ACCACCCTGA GCATCGCCAT CCCCGGCTGC
GCGGCGGTGG TCGCGACGGC GCTGGTCGCC GACCCCGACG GCTACCCCCT GGAACTCAAG
ACCATCGGCC CGACCGCCTG GCTACTGATG GCCGCGTCAC TCGGCGGCGC GAGACGGATC
AAACAGGACT ACCTCGACGC CGTCAAAGCC CGCGCCGAAT ACGCAGAACG CACCCGCGAG
GCCGAAGCCC GCCGCCGGGT CGCCGACGAA CGCACCCGCA TCGCCCGCGA ACTCCACGAC
GTCGTCGCCC ACCACATCAC CCTCGCCCAC GCCCAGGCCG GCACCGCCGC ACACCTCGTC
CGCACCCACC CCGACCAGAC GGAACCGATC CTCACCAACC TCACCGCCAC CACCTCCTCC
GCCCTGCGCG ACCTCAAGGC CACCGTCGGC CTCCTGCGGC AGAGCGACGA CCTGGACGCG
CCGCTGGAGC CGGCCCCCTC GCTCGCCCAG CTCCCCCAGC TCGCCGACAC GTTCGCCGCG
ACCGGGCTCA CGGTCACGAT CACCACCCGT GGCGAACCGT CCCCGCTCTC CCCCGGCATC
GACCTCACGG CCTATCGGAT CGCGCAGGAG GCCCTCACCA ACGTGGCGAA GCACGCCAGG
ACCGACAACG CCTGTGTCGA CATCACCTAC GCCCCGCACA GCGTCACCCT GATGATCATA
AATGGCGGCG GGGAGAACGC GGGCCCGGTC AGCCGCGGCG CGCGTCCCGG CGCCGGGACG
TCGATCCCGG CCTCGGGCAG CGGGTTCGGC CTGATCGGCA TGCGGGAACG CGCACTGTCC
GTCGGCGGAC ATCTCGAGGC CGGTCATCAT CCCGAGGTCG GCTTCCACGT CACCGCCATC
CTGCCCCTGC ACCCCAGAAC TCCCACGAAA ACGGAAGCCG ATGACTATCC GAGTGCTCCT
CGCCGACGAC CAGACCCTCC TACGGGCAAC CTTCCGGATT CTGATCGACT CCTGCAGCGA
CATGGAGGTC GTCGGTGA
 
Protein sequence
MTITSRWRRL RDHHRTSTVV IAVLAFAAAG PASRLNPQHV AHDSSPWPAV LLAAVACTAL 
LWHERHPRAT AVVAITCTVF LAGLGYLLTS LMIAPAMAAL YWLAAHTDRR TTLSIAIPGC
AAVVATALVA DPDGYPLELK TIGPTAWLLM AASLGGARRI KQDYLDAVKA RAEYAERTRE
AEARRRVADE RTRIARELHD VVAHHITLAH AQAGTAAHLV RTHPDQTEPI LTNLTATTSS
ALRDLKATVG LLRQSDDLDA PLEPAPSLAQ LPQLADTFAA TGLTVTITTR GEPSPLSPGI
DLTAYRIAQE ALTNVAKHAR TDNACVDITY APHSVTLMII NGGGENAGPV SRGARPGAGT
SIPASGSGFG LIGMRERALS VGGHLEAGHH PEVGFHVTAI LPLHPRTPTK TEADDYPSAP
RRRPDPPTGN LPDSDRLLQR HGGRR