Gene Franean1_4333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4333 
Symbol 
ID5672688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5176408 
End bp5177439 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content75% 
IMG OID641243206 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001508623 
Protein GI158316115 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0375931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCGG CCAGGGTGCC GGGACCGGTG GCGCCACCGG TCGTGATCGG GCTGGCGGCC 
GGGGCCGTCG TGCTGCTGAT CTGGGCGGCC GTCGGCGGTG GTTACTTCTG GCCGCGCTGG
GTGTGGTTCG GCATCGGCAC CGTGTTCTGG GCCGCGATCC TCGTCGGGTG GGTCCGGCGG
ATACCGCCGG GGCGGCGCCG GTGGCTCGCC GCCACCCGGG CGGTGGCTGC CCTGGCCATC
CCGGTCGACG TGGTGGTGTG GGCGCTGTCC GGCGGCGGCT ACTTCTGGCC GGTCTGGACG
ATTCTCGCGC TCACGATCGG GCTCGCCATC CACACCTGGA TCGTCGCGCT GATGCCCGCC
GAGCGGGAGC GTGAGCTCAC CGAGCGGGTC GACGCGCTCA CCCGCACCCG CCGCGGCGCG
CTCGACGGGC AGGCCGCCGA GCTCAAGCGG ATCGAGCGGG ACCTGCACGA CGGCGCCCAG
GCGCGGATCG TCTCGCTGGC GATGAACCTC GGCATGGCCG AGGCCCTGCT GCACAGCGAC
CCGGCGGCGG CGGCGAAGCT GCTCAGCGAC GCCCGGCTGT CCGCGGTCGG CGCGCTCGAC
GACCTGCGGG CCGTCATGCA CAGCATCCAC CCCTCGGTGC TCGCCGACCG GGGGCTCGCC
GGGGGCATCC GCGCGCTCGC GCTCGACCTG TCCCTGCCGG TCCGCGTCGA CGGCGACGTC
CCGTCCGGGC TGCCGGCGGC GGTCGAGTCG GCCGTCTACT TCGCGACCGC GGAATGCCTG
GCCAACGTCG TCAAGCACAG CCGGGCCGCG CACGGCACGG TGCGGTTCGC GCACGACGGC
AGGATGCTGA GCGTGGTCGT CACGGACGAC GGGCTCGGCG GGGCGGATCC CGCGTTCGGC
CAGGGCCTGC GCGGGGTGGT GCGCCGGCTC GAGGCGTTCG ACGGCCGGAT GTCGGTACAC
AGCCCGTCCG GGGGACCGAC GAGGATCACG ATCACCCTGC CGTGCCCGGT CCTGCCGGGC
GCGGAGGCCT GA
 
Protein sequence
MASARVPGPV APPVVIGLAA GAVVLLIWAA VGGGYFWPRW VWFGIGTVFW AAILVGWVRR 
IPPGRRRWLA ATRAVAALAI PVDVVVWALS GGGYFWPVWT ILALTIGLAI HTWIVALMPA
ERERELTERV DALTRTRRGA LDGQAAELKR IERDLHDGAQ ARIVSLAMNL GMAEALLHSD
PAAAAKLLSD ARLSAVGALD DLRAVMHSIH PSVLADRGLA GGIRALALDL SLPVRVDGDV
PSGLPAAVES AVYFATAECL ANVVKHSRAA HGTVRFAHDG RMLSVVVTDD GLGGADPAFG
QGLRGVVRRL EAFDGRMSVH SPSGGPTRIT ITLPCPVLPG AEA