Gene Acid345_3357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3357 
Symbol 
ID4071275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3981390 
End bp3984302 
Gene Length2913 bp 
Protein Length970 aa 
Translation table11 
GC content60% 
IMG OID637985379 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_592432 
Protein GI94970384 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.984464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAGTG AGATGGCAGT AGATTTGCAC ATGGATTCGC CGGAGATTAC CGTCCTTCCC 
ACCAGCTCTG GCAAAGGACG GAACCGGCCC ATGCTGAAGG AGAATCAAGT CCGCTTCGTT
GCCGCCGTTG TGGCGCTGCT GACCGCGACT GCAATCGTAT TCTCCTTCAT TAACTTCCAG
AAAGAGCGGG AGTTCGAGAC CCCTACCGAC GGCGTCTGGT GGGTGGAATC CGGCGGCCAT
CTCAAGGCCG AAAAGGTCGA GGCCGACGGC CCCGGCGAGA AGGCCGGTAT CAAGCAGGGT
GACGTCCTGA TCGCCATCAA CGGCGTCGAT ATTACCCGCA AAGCCGTGCA AGTGCGGCAG
ATTTACCGCA CCGGAAGCTG GTCAAAGGCT ACTTTCTCGC TGACCCGCAG CAATGTGCCG
ATCGAAGTCC CGGTGATTCT CACTCCGGCC GATCGCTCCC TGAATGGCGG CTTGCGCCTC
ATCGCGCTGA TTTACCTTGG CATCGGCATC TACGTTCTGT TCCGGCGCTG GACGGCCCCA
AAGGCGACCC ATTTCTACCT GTTCTGCCTG GCGTCGTTCG TTTACTACTC GTTCCACTTC
ACCGGTAAGC TGAACCAGTT CGACTGGATC ATCTACTGGT CAAACGTGAC CGCATGGCTG
TTACAGCCGG CGCTCTTCCT GCATTTCGCG CTCACATTCC CCGAGACCAA AGACGTAGTA
AAGCGACACA AGTGGCTGGT GCCTGCTGTC TATGCGGTGC CCGCCGCGCT GCTTTCGCTG
CATATCGTGG CGCTGAATTT CCTGCGCCCC AGTGAAGTCC TGCGCTGGAA CCTCGACCGC
GGCCAGATGC TGTATCTCGC GGTGTATTTC GTGGCTGCGA CCGTGGTGCT CTGGCAGACC
TACGCGAATG CCGCCTCGCC CATCCTGCGT CAGCAGATGA AGTGGGTGAC CCGCGGCACG
TTCATGGCGA TCGCGCCGTT CACCATCTTC TACGTCATCC CGTACCTGCG CGGCAGCCTG
CCGACCGCGG CCATGAAGAT CTCGGTGCTC TCACTGATCT TCCTCCCGTT GACCTTCGGC
TACGCCATCT TCCGCTATCG CCTGATGGAC GTGGACCTCA TCTTCAAGCG CGGCATGGCC
TACACACTCG CCGCCGGCAC GATCACCGGT ATTTACTTCA TGGCGATCGG CGGGGCCTCG
GAAATGTTCC ACAAGAACTT CCCGAGCGCC GGCCCAGCCG GATTGATGGC GGCGATCGTC
GTCACGGCCT TGCTCTTCGA TCCGTTCAAG AACTGGATCC AGGACAAGCT CGACAAGTTC
TTCTATCGCA AGCGGTATGA CTACCGTAAG ACGCTCATCG AATTCGGCCG CGACCTGAAC
TCCGAAACCG ATCTCGACAA GATGCTGGCG TCCATTGTGG ACCGCCTCTC GCGCACGCTG
CTCGTCGATC GCATCGCCGT CTTCGTGCAC GATGAACAAA GTCGCTGGGT GCTGGCAAAG
AGCTGCGGCA TCTCGCAGAC CACCGGGCTC GACATGAGCT TCATGAACGA AGAGCGTCCG
GACATGGCTG CAGCCGGGCA CCTGTTCTTC GACAACACCA GCCAGGCAGT TCGCGAAAAT
CCCGGCGCGC GCGAGACCAT TCGCCGCCTC GATCTGAACT ACTATCTGCC TTGCACGGTG
ATGGGCCGCA CCATCGCGAT GGTCGGCCTT GGCAAGACCA CTGAGGGCGA CTTCCTCTCC
AGCGAAGATG TAGAGCTGCT CGAGACGCTG GCCGGCTACA TCGGCATCGC GCTGCAGAAC
GCGCGCCTCT ACCAGTCGCT CGCAGAAAAA ATCACCGAGT ACGAGCGGCT CAAGGAATTC
AACGAGAACA TTGTCGAATC CGTAAGCGTC GGTGTGCTCG CCGTCGATCT CGAAGACAAG
ATCGAATCGT GGAATGCGCA GATGGAAGTG ATGTACGCGC TGCCCCGCGC GGAGGCCCTC
GGCAAGCGCC TCTCCGATGT CTTCCCGCTG AATTTCGTGG AAGAGTTCTA TCGCGTCCGC
CAGGTTCCCG GCATTAACAA TCTCTATAAG TTCCGCATGG GCACTCCCGC GGGCGACACG
CGCATTTGTA ACATCGCTAT CGCACCGCTG GTCACGCGCG ATTTCAACGT CATCGGCCGC
ATCATCATCC TCGACGACAT GACCGATCGC GTCGAACTCG AATCGCAATT GGCGCAGGCA
GAAAAGCTTT CGTCCATTGG ATTGTTGGCC GCCGGCGTGG CGCACGAAGT CAATACGCCA
CTTGCGGTGA TCTCGTCCTA CGCGCAGATG CTCTCCAAGC AGTTGCAGGG CGATGAACGC
CGCTCGGCGC TGCTCGAGAA GATCACCACA CAGACGTTCC GCGCCTCGGA GATCGTTAAC
AACCTGCTGA ACTTCTCCCG CACCGGCAGC AGCGAATTTG CTGAAGTAGA CATCAACAAG
GTTGTCAGCG ACACGCTCGC CTTACTCGAG CACCAGCTAA AGACCTCGCG CGTGAAGGTG
GAAAACCACC TCGCGCCTAC GCTGCCCAAG ATCTACGGCA ACACCGGCAA GTTGCAGCAA
GTGTTCCTGA ACCTCTTCCT CAACGCCAAG GATGCGATGC CCTCGGGCGG CACGCTGAGC
ATCACCACGC GCAACGGCCG CGCGGTTGAG GTTGAGGTAT GCGACACCGG AAGCGGCATT
GCGCCGGAAC ACATCCAACG CATCTACGAT CCATTTTTCA CCACGAAGAA ATCGCCGCGG
CAGGGCCACT CCGGAGGCAC CGGCCTCGGA TTGGCTGTGA CCTACGGCAT TATCCAGGAA
CATGCGGGCA AGATCCGCGT GGACAGCCGC CCCGGCGAGG GCACGCAGTT CACGATGGAG
TTCCCCATGG TCAGGAAGGC TGTGAATGCC TGA
 
Protein sequence
MQSEMAVDLH MDSPEITVLP TSSGKGRNRP MLKENQVRFV AAVVALLTAT AIVFSFINFQ 
KEREFETPTD GVWWVESGGH LKAEKVEADG PGEKAGIKQG DVLIAINGVD ITRKAVQVRQ
IYRTGSWSKA TFSLTRSNVP IEVPVILTPA DRSLNGGLRL IALIYLGIGI YVLFRRWTAP
KATHFYLFCL ASFVYYSFHF TGKLNQFDWI IYWSNVTAWL LQPALFLHFA LTFPETKDVV
KRHKWLVPAV YAVPAALLSL HIVALNFLRP SEVLRWNLDR GQMLYLAVYF VAATVVLWQT
YANAASPILR QQMKWVTRGT FMAIAPFTIF YVIPYLRGSL PTAAMKISVL SLIFLPLTFG
YAIFRYRLMD VDLIFKRGMA YTLAAGTITG IYFMAIGGAS EMFHKNFPSA GPAGLMAAIV
VTALLFDPFK NWIQDKLDKF FYRKRYDYRK TLIEFGRDLN SETDLDKMLA SIVDRLSRTL
LVDRIAVFVH DEQSRWVLAK SCGISQTTGL DMSFMNEERP DMAAAGHLFF DNTSQAVREN
PGARETIRRL DLNYYLPCTV MGRTIAMVGL GKTTEGDFLS SEDVELLETL AGYIGIALQN
ARLYQSLAEK ITEYERLKEF NENIVESVSV GVLAVDLEDK IESWNAQMEV MYALPRAEAL
GKRLSDVFPL NFVEEFYRVR QVPGINNLYK FRMGTPAGDT RICNIAIAPL VTRDFNVIGR
IIILDDMTDR VELESQLAQA EKLSSIGLLA AGVAHEVNTP LAVISSYAQM LSKQLQGDER
RSALLEKITT QTFRASEIVN NLLNFSRTGS SEFAEVDINK VVSDTLALLE HQLKTSRVKV
ENHLAPTLPK IYGNTGKLQQ VFLNLFLNAK DAMPSGGTLS ITTRNGRAVE VEVCDTGSGI
APEHIQRIYD PFFTTKKSPR QGHSGGTGLG LAVTYGIIQE HAGKIRVDSR PGEGTQFTME
FPMVRKAVNA