Gene Franean1_4980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4980 
Symbol 
ID5673319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5975058 
End bp5976953 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content75% 
IMG OID641243834 
ProductGAF sensor signal transduction histidine kinase 
Protein accessionYP_001509250 
Protein GI158316742 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.691708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.877049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCAGA CCGGCGCCGG CCCAGATGCG CCGGCGACCA GCAGCGCCGA GCCGCGGACG 
CTCGGCGGCG CCGGCCGTCG CCTCAAGCCC GGGCCGAGGT CGGAGGTCTA CGCGGTGCAG
AGCCGGATGC GCGGCCTGCT CGACGCCGTG GTCGATGTCG CGCGCGAGCT GAGCCTGCCA
GTGACGCTAC GCCGGATCGC GCAGGCGGCC CGCTCCCTGG TGGACTGCGA GCTCGGCGCG
CTCGGGGTCC TCGGCCAGGA CGGCGCCATC ACCGAGCTGA TCGCGGTCGG CCCGGGGGAG
GAGGTTCCCC GCGACATCGC CCGGATGCCG CCCGGCCGCG GCCTGATCGG CGAGCCCGGG
GGCGGCCCCC ACCCGGGCGG GAACGCACCG CGGGTGGGCA CCGCCGTCGT CGCGCCGGAG
ACGCTCGGCT TCCCGCCCGG CCGGCCTCGT TTCACCACGT TCCTCAACGT CCCGATCGCC
GTCCGCGGGG AGGCGTTCGG CAACCTGTAC CTGGTGGGCA AGCGCGGCCC CGAGTTCACC
CAGGAGGACG AGGACCTGGT CGAGGCGCTC GCCGCGGCCG TCGGCTTCGC CATCGAGAAC
GCCCGCCTTT ACGAGGCCAC CCGGCGGCGC CAGGCCTGGC TCACCGCGAG TGCCGAGATC
ACCACGGCGC TGCTGTCGGT GGCCGAGCCC GTCGACGCGT TACGGCTGGT CGCCCGGCGT
GCCAGGCAGA TCACCTCGGC CCTGCTCGCC GCGATCGTGC TCCCGGTGAA CAGCGCGGAG
GCGGCCCGGC CCGGCCGCTT CGGCGTCCGG CCCCGCCGGC CCGCCGCGCG GGCCGGCGGG
GCCCCGAGGT CGCTGGAGGT GGCCGTCGTC GACGGTCCGC TCGCCGAGCA GTTGCGGGGG
CGGATGCTGC CCGAGCGGGT CGGGCTGTTC AAGATCATGA AGGCGGGCCG GGCGGTGCTC
GTCCCCGCCG AGCGCGCCGA TCCCGCCGCG CACAGCCTGC TGGGCGAGGC CGTCGACGGC
CTGGTGATCG GCACCGTGAT GGTCATCCCG CTGTTGGCCG CCGGGCGGCC GCTGGGGGTG
CTCATGCTCA CCGCGGCCCC GGGCGCGGTC CCGTTCGGCC AGCTCGACCT GGAGATGGCG
GCGGCGTTCG CCGGTCAGGC CGCGCTCGCC CTGGAACTCG CCCGCGTGCA GTGGGACAGG
GAACGCCTCG CGGTGTTCGA GGAACGCGAC CGCATCGCCC GCGACCTGCA CGACGTCGTC
ATCCAGCGGC TGTTCGCCAC CGGTCTGCAG ATGCAGGGCC TCGCCCGGGT GATCGACGAG
GGCGCCGCGG TGCGGCTCAA CGACGCCGTC CGCGAGCTGG ATCAGACGAT CGCCGACATC
CGCCATACGA TTTTCTCGCT GACGGCGTCC GCCGGCGCCG TCGACCTGCG GGCCGAGATC
GCCGGAATCG TGTCACAGGC CGAGCAGGCG CTGGGCATCC GGCCCACGGC CCGCATCGAC
GGCCCGGTCG ACCGCGGTAT CCCGGAGGTG ATCCATCCGC ACCTGCTGGC GGCCATCCGT
GAGGCGCTGT CGAACATCGC ACGGCATGCC CGGGCGACCC GCATCGAGGT GCTGGTGCGG
GTCACCAACA CCGACGTCTC GGTGCAGGTG CGCGACGACG GCTGCGGCCC GGGTGGCGCG
TCGCGCAGCA GCGGCCTGAC GAACCTGCGC CGCCGGGCGC TCGACCTCGG CGGCCGGATG
GAGTTCGGCC CAGGCGAAGA CGGCATCGGC ACGACGGTGA CCTGGTACGT GCCGCTGGTT
CAGCCCATCC CGCCACCGCG CGCGCTGCCG CGGGCCGGGG ACACCCCCGC CCCGCGACTC
GGCGTCGGGC CCGCGGAGGG GCCTGGTGGG CCCTGA
 
Protein sequence
MVQTGAGPDA PATSSAEPRT LGGAGRRLKP GPRSEVYAVQ SRMRGLLDAV VDVARELSLP 
VTLRRIAQAA RSLVDCELGA LGVLGQDGAI TELIAVGPGE EVPRDIARMP PGRGLIGEPG
GGPHPGGNAP RVGTAVVAPE TLGFPPGRPR FTTFLNVPIA VRGEAFGNLY LVGKRGPEFT
QEDEDLVEAL AAAVGFAIEN ARLYEATRRR QAWLTASAEI TTALLSVAEP VDALRLVARR
ARQITSALLA AIVLPVNSAE AARPGRFGVR PRRPAARAGG APRSLEVAVV DGPLAEQLRG
RMLPERVGLF KIMKAGRAVL VPAERADPAA HSLLGEAVDG LVIGTVMVIP LLAAGRPLGV
LMLTAAPGAV PFGQLDLEMA AAFAGQAALA LELARVQWDR ERLAVFEERD RIARDLHDVV
IQRLFATGLQ MQGLARVIDE GAAVRLNDAV RELDQTIADI RHTIFSLTAS AGAVDLRAEI
AGIVSQAEQA LGIRPTARID GPVDRGIPEV IHPHLLAAIR EALSNIARHA RATRIEVLVR
VTNTDVSVQV RDDGCGPGGA SRSSGLTNLR RRALDLGGRM EFGPGEDGIG TTVTWYVPLV
QPIPPPRALP RAGDTPAPRL GVGPAEGPGG P