Gene Acid345_3068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3068 
Symbol 
ID4071975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3645621 
End bp3647459 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content60% 
IMG OID637985087 
Productserine phosphatase 
Protein accessionYP_592143 
Protein GI94970095 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.266208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCGCGC GCCCCCGGTC CCCGAAAGGT TTCTTCGTGA CGGCTACTGG CACCGTGCCC 
AGCGGACAGC AGGACGACCT CCTCCAGGCA TTCGCAACCG CGATTACTCA CATTACTGCG
CCGCGTTCGC AGAACGACCT CATTTCGCAG ATCAGCACGC TGATGGTGGC GAACTTTGGG
GCTGCACGCT CCGAGCTTTG GCTGCATGAC GGTAGCGACA ACCTGAACCT CGCGAGCGCG
GCGGGAGCGG CGGCCGAGCA CAACATGCTG CGGATGTCGG TGGAAGGGAA CCCGATTGGC
CAGGCGTTCA CGGAGCGCAA ACAGATTTCC GCCCAGGAGC CGACGAGCAC GCTGGCGTAT
GTCTCGGTGC ATCCGCTGGT GAATTGCAAT CAGTGCCTTG GCGTAGTGGT GAACGCGGCG
CAAAATCCGC CAAGCGATGA ACAGAGGGGC TGGTGGCAGA CCTTCGCCGA TGCGAGCGCA
ATCGCGCTGC ACCAGATGTT CGCGTCCGAA GATAGCCAGA AGACGATTAC GCAGCTGTCG
TTGCTGTTCG AAGCGACGCG ACTGCTGAAC TCGACGCTCG ATCTTGCAGA GCTGCTCGAG
TTGATCATGA AGATCGCGCG CACCGAGGTG AAGGCCGACC GGGGAAGCGT GTTCCTTGTA
GATAAGGCAC ACAAGGAGTT GTGGTCGATT GTGGCGTCGG GACTGGAGCA CCAGGAACTG
CGCGTGCCAT TCGGGCGTGG CGTGGCAGGC CGTGTGGCGG AGACCGGCGA AGTGATCAAC
GTGGCCGATG CGTACACGCT GCCGTTCTTC GATCGCAGCT TCGACCAGAA GACTGGGTAC
ACGACGAAGT CGCTGCTGTG CCTGCCGATT CGTCACCACA ATAATGAGAT CGTCGGCGTG
CTGCAGTTGC TGAACCAATC CACGCACGGA CGGTTCACCC CCGAAGACCA GGAGTTCCTT
ACCAAGCTGA CCGGCCACAT GGCTATGGCC CTGGAGAATG CACGGCTGCA CCGCGAGGCA
CTCGAGAAGC AGCGCATGGA ACGCGACCTG GCGGTGGCGA GAAACATTCA GCGCAGCCTG
TTGCCGGAAG CGCCGCCGGT GGTGCCGGGA TACGACATCG CCGTGATCAA CCACATGTGT
TACGAGGTCG GCGGCGATTA CTACGACTTT CTAAACCTGG GTCCGCAAAC GCTGTTGATC
GTTGTTGCTG ACGTCGAAGG CAAGGGCGTT AGTTCGGCGT TGGTGATGTC GAACTTGCAA
GCGACGCTGC GCGCGCTGGT GATGCACCTG CACTCGCTCG AGGTGCTGAC GATCTCGCTG
AACGAAATGA TCTTCAACGA CACCAAGTCG GAGAAGTTCC TCAGCATCTT CCTGGGGCTG
GTGGATATTC GTCGCGGTGG GTTGCATTAC ATCAACGCAG GGCACGTGCC GCCGATCCTT
GTGAAGGGCG CGACGGGCGA GTTCAAGACG CTTGAAGATG GCGGGACGGT GATCGGATTG
TTCCCGGATG CGGAGTACAA CCGAGGCTCG GCGAAGTTGG AGCCGGGGGA CATCCTGGTG
TGCTGCACCG ACGGCATTGA GGAAGCGAGC AATACTGAAG ACGAAGAGTA TGGGACCGAG
CGGCTCGCGG AGGCAGTGGC GCGGCATCGG TCGAAGCACG CGAAAGAGAT TGTAGAAGCG
GTGCTGGAAG AAGTGACAGC ATTCTCCGTC GGCGGGAAGA ACATTGACGA CAAGGTGTTG
ATGGTGATGA AGGTCACGAC CGATGGAAAG TTTGATCAGG CGAACGCGGC GGGAGAGAAA
CAGCTGGTGA AGGAGCCAGT GCTGCCGAGG CATCGGTAG
 
Protein sequence
MAARPRSPKG FFVTATGTVP SGQQDDLLQA FATAITHITA PRSQNDLISQ ISTLMVANFG 
AARSELWLHD GSDNLNLASA AGAAAEHNML RMSVEGNPIG QAFTERKQIS AQEPTSTLAY
VSVHPLVNCN QCLGVVVNAA QNPPSDEQRG WWQTFADASA IALHQMFASE DSQKTITQLS
LLFEATRLLN STLDLAELLE LIMKIARTEV KADRGSVFLV DKAHKELWSI VASGLEHQEL
RVPFGRGVAG RVAETGEVIN VADAYTLPFF DRSFDQKTGY TTKSLLCLPI RHHNNEIVGV
LQLLNQSTHG RFTPEDQEFL TKLTGHMAMA LENARLHREA LEKQRMERDL AVARNIQRSL
LPEAPPVVPG YDIAVINHMC YEVGGDYYDF LNLGPQTLLI VVADVEGKGV SSALVMSNLQ
ATLRALVMHL HSLEVLTISL NEMIFNDTKS EKFLSIFLGL VDIRRGGLHY INAGHVPPIL
VKGATGEFKT LEDGGTVIGL FPDAEYNRGS AKLEPGDILV CCTDGIEEAS NTEDEEYGTE
RLAEAVARHR SKHAKEIVEA VLEEVTAFSV GGKNIDDKVL MVMKVTTDGK FDQANAAGEK
QLVKEPVLPR HR