Gene Franean1_6438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6438 
Symbol 
ID5674753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7826906 
End bp7828912 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content73% 
IMG OID641245286 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_001510681 
Protein GI158318173 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.160303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0428099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGGGCT ATGCCGACGT GCCGCCCGGC AGTCTGCTGG ACGCCGCGCC GGACGCGATC 
GTCGGCGTGC GCCCGGACGG GCGGATCGCG CTGGTGAACG CGCAGGCCGA GCGGCTCTTC
GGGTACGACC GGGACGAGCT CGTCGGGCAA CCGATGGAGA TCCTCGTCCC GGAGGCGTTG
CGCGGGGCGC ACCCGGCCCG CCGCGGCGCG TACCTCTCCG ACCCGCGGCC GCGGCCGATG
GGCGCGGGCG TGGAGCTCGC CGCGCGCCGC CGGGACGGCA CGGAGTTCCC CGCCGAGATC
TCGCTGTCGG CCGTCCAGAC CGTCGAGGGC ATCTTCGTCA CGGCCGCGAT CCGGGACGTG
ACGAGCCGCA AGCGGGCCGA GGCCATGTTC CGCGGCCTGC TGGAGGCGGC GCCGGACGCG
ATCGTCGGCG TGCGCCCGGA CGGGCGGATC GCGCTGGTGA ACGCGCAGGC CGAGCGGCTC
TTCGGGTACG ACCGGGACGA GCTCGTCGGG CAACCGATGG AGATCCTCGT CCCCGAGTCG
GCGCGGCACC TGCACCCGCG GCACCGCACC CGCTACTTCG ACGATCCGCG GCCGCGGCCG
ATGGGCGCGG GGATGCAGCT CGCCGCGCGC CGCCGGGACG GCACGGAGTT CCCCGCCGAG
ATCTCCCTGT CCGCGCTGGA GACCGAGGAC GGGCTGCTCG TCTCGGCCGC GATCCGGGAC
GTCACCGACC GGCTGGAGGC GCAGGCGGAG CGCGAGCGGC TGCGGGCCCA AGCGGAGCGG
GAACGGCTCG AGGTACAGCT GCACCAGTCG CAGCGGCTGG AGAGCCTCGG CCAGCTCGCC
GGCGGCGTGG CACACGACTT CAACAATCTG CTCGCAGTGA TCATCAACTA CACGGCCTTC
GTCGGTGAGG TCGTCGGAAC CGCCGCCAGG GACCAGGGCG GCCGGTGGGA GTCCGCCCGC
CGCGACATCG AGCAGGTCCA GCGGGCGGCC GACCGGGCCG CCCAGCTCAC CCACCAGCTG
CTCGCCTTCG GCCGGCGCGA GGTGGTCCAG CCGCGCCCGC TGGACCTCAA CGGGGTCGTC
CACGACATGG AGCAGTTGCT GCGCCGCACC CTCGGCGAGC ATGTGGAGCT GCACACCTCC
GCCCACCCGG GCCTGTGGAC GGTGCTGGCC GACGTGGGCC AGATCGAGCA GGTGCTGGTG
AACCTCGCCG TCAACGCGCG GGACGCGATG CCCGGTGGCG GCACGCTCAC CATCGACACC
TCGAACGTGA GCGTCGACGA CGAGACGGCG GCCGCCCACC CGGGGCTGTG CAGCGGGCGC
TTCGTGCACC TGCGGGTCAG CGACAGCGGG ACGGGCATGT CGCCGGAGGT CGCCGCGCGC
GCGTTCGAGC CGTTCTTCAC GACCAAGACG AAGGGCGAGG GCTCCGGGCT CGGCCTGGCC
ACCGTCTACG GGATCATCAC CCAGGCGGGC GGGCACGTGG AGATCCTGTC GACGGCCAAC
GTCGGCACCA CCGTCAGCGC GCTGCTGCCG GCCGTCGACG GCGCCGTCCC GGCGGCTGAG
GAGCAGCTCG CCGACGGTGA GCTGTTCGGC GGCGAGACGA TCATGGTGGT CGAGGACGAG
CCGGCCATGC GCGAGCTCAC CCGTCGCATC CTTGCCCGCA ATGGCTACCA GGTGATCATC
GCGGCGAGCG GGCCCGAGGC GATCTCGCTG GCGTCGACGA CGCGGGAGGA GATCCACCTG
CTGCTGACCG ACGTGGTCAT GCCGCAGCTG CTCGGCAGCG AGGTCGCCGA GCGGATCCGC
CAGCAGCGGC AGGACCTGCG CGTCCTCTAC ATGTCCGGGT ACGCCCAGCC CGTGCTTGCC
CGGAACGGGA CGCTCGCCCC GGGGCTTGCC CTGCTGACCA AGCCGTTCTC CGAGCGGGTG
CTGCTGTCGA AGGTGCGGGA GGTTCTGGAT TCCCTCAGCG GCCCTTTCTC CCCGCACGGC
GCGCCGCCCT GGCCCCGCGA CAGCTGA
 
Protein sequence
MAGYADVPPG SLLDAAPDAI VGVRPDGRIA LVNAQAERLF GYDRDELVGQ PMEILVPEAL 
RGAHPARRGA YLSDPRPRPM GAGVELAARR RDGTEFPAEI SLSAVQTVEG IFVTAAIRDV
TSRKRAEAMF RGLLEAAPDA IVGVRPDGRI ALVNAQAERL FGYDRDELVG QPMEILVPES
ARHLHPRHRT RYFDDPRPRP MGAGMQLAAR RRDGTEFPAE ISLSALETED GLLVSAAIRD
VTDRLEAQAE RERLRAQAER ERLEVQLHQS QRLESLGQLA GGVAHDFNNL LAVIINYTAF
VGEVVGTAAR DQGGRWESAR RDIEQVQRAA DRAAQLTHQL LAFGRREVVQ PRPLDLNGVV
HDMEQLLRRT LGEHVELHTS AHPGLWTVLA DVGQIEQVLV NLAVNARDAM PGGGTLTIDT
SNVSVDDETA AAHPGLCSGR FVHLRVSDSG TGMSPEVAAR AFEPFFTTKT KGEGSGLGLA
TVYGIITQAG GHVEILSTAN VGTTVSALLP AVDGAVPAAE EQLADGELFG GETIMVVEDE
PAMRELTRRI LARNGYQVII AASGPEAISL ASTTREEIHL LLTDVVMPQL LGSEVAERIR
QQRQDLRVLY MSGYAQPVLA RNGTLAPGLA LLTKPFSERV LLSKVREVLD SLSGPFSPHG
APPWPRDS