Gene Acid345_3619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3619 
Symbol 
ID4070139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4280102 
End bp4282147 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content63% 
IMG OID637985642 
Productcarbon starvation protein CstA 
Protein accessionYP_592694 
Protein GI94970646 
COG category[T] Signal transduction mechanisms 
COG ID[COG1966] Carbon starvation protein, predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.186111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.595383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAACC GCGTGGTTCG TGCCCTCATC TGGGCCGCTG TCATCGTCGT GGGCGCGCTC 
TCCATCGCCA CCATCGCACT GCGGCGCGGC GAGTCCATCA ACGCGATGTG GCTGGTCGTG
GCCGCGCTCT GCACCTACGC CGTCGGTTAC CGTTTCTATA GCAAGTTCAT CGCGACCAAA
GTCCTCGTCC TCGACAGCCG TCGAGCCACG CCAGCCGAGC GCCTCGATAA CGGACGCGAT
TTCGTTCCCA CCAACAAATG GGTTGTCTTC GGACACCACT TCGCCGCCAT CGCCGGTCCC
GGCCCGCTCG TCGGCCCGGT ACTCGCGGCG CAATTCGGCT ATCTTCCCGG AACTCTATGG
ATCCTCGTTG GCGCCGTCCT CGGCGGATGC GTGCAGGACT TCGTCACTCT CGTCTTTTCC
GTGCGTCGCG ACGGCAAATC GCTAACCGAA ATGGCCCGCG AAGAGATCGG CAAAGTCGGT
GGCTTCGTGG CATTTTTCGC GGTCGTCGCG ATCATCATCA TCCTGCTCGG CGCGGTTGCG
CTCATCGTCG TCAACGCGCT GAAGGGAAGC CCGTGGGGCA CCTTCACCAT CGGCATGACG
ATCCCGATCG CGCTGCTGAT GGGCCTATAC CTGCGCCGTA TTCGTCCGGG CAAAGTGATG
GAAGCCAGCG TTCTCGGCTT CATCTGCGTA ATGGCTGCGA TCTTTGGCGG ACAATGGGTC
TCGCACACCG CGTACGCCGG ATGGTTCACC TACTCCGCGA CGACGCTCGC CATCGCGATC
ATCATTTACG GCTTTGCCGC CTCGGCGTTG CCGGTCTGGC TGCTGCTTGC GCCGCGTGAT
TACTTGAGCA CCTTCGTGAA ACTCGGCACC ATCGCGATCC TCGCGGTCGG TATCGTGATT
TCGCGGCCGC AACTGCACAT GCCCGCGCTT ACTCGCTTTA TTGACGGCAC TGGGCCGGTC
TTCGCCGGAA ATCTCTTCCC GTTCGCGTTT ATCACCATCG CCTGCGGCGC AATCAGCGGA
TTTCATGCGC TGATCTCGAG CGGCACCACG CCCAAGCTAA TCCAGCGCGA GACGGAAACG
CGCCTCGTCG GCTACGGCGC GATGCTCTGC GAATCGCTGG TTGCAATCAT GGCGACGATC
GCCGCGTGCG TGATCCAGCC CGGCGTGTAC TTCGCGGTGA ATAGCCCGGC GGGCATCGTC
GGCGCCAACC CTGCCGATGC CGCCGCCAAA ATCACTTCCT GGGGATTCCC GGTGGACGCC
GCGCAGATGG CGCAACTCGC GCACGCCGTC GGCGAGCAGA CGCTCTTCAA CCGCACCGGC
GGCGCGCCGG CATTCGCTGT CGGCATGGCG CACATCTTCG CGCACTCGCT CGGTGGCGAG
GCGGTAATGG CGATCTGGTA CCACTTCGCC ATCATGTTCG AGGCGCTTTT CATCCTCACC
GTGCTCGACG CCGGCACCCG CGTTGGGCGC TTCATGCTGC AAGACGCGCT CGGCCACGTG
TGGAAGCCGC TTGGCCGCAC CAGTTCTTAC CCGAGCATCG TGCTGACGTC CGCGATCATC
GTCGGTCTGT GGGGATACTT CCTCTGGCAG GGCGTGAAGG ACCCGCTTGG CGGCATCAAC
TCCATGTGGC CACTCTTCGG CATCTCCAAC CAATTACTTG CTGCGGTCGC GCTTTGCGTT
GCCACCACCA TCGTCATCAA GATGGGCCGC GCCCGCTACG CCTTCGTAAC GCTGGTGCCG
CTGATCGTTT TGGTCGCGAT CACCTTTGGC GCAGCGTACG AAAAAGTCCT CAACCCGAAT
CCGCGAATCG GCTTCCTTGC CCACGCCCGT CAACTCGCGT CGCAGCCCAA CATATCGCAT
CAGACGTCGC AACTGATCTT CAATGACCGC CTCGATGCTG CCATCACCAC GGTGCTCGTC
TTCCTCGTAA GCCTGGTCGT GATCGAATCC ATCATCGAGT GGGTGCGCGT GCTGAGCGGT
CGAAAAGCCG CGACTGTCCG CGAAGCTCCG TTTGTCGCCT CGCACTTCGC GGAGGAACAA
GCATGA
 
Protein sequence
MRNRVVRALI WAAVIVVGAL SIATIALRRG ESINAMWLVV AALCTYAVGY RFYSKFIATK 
VLVLDSRRAT PAERLDNGRD FVPTNKWVVF GHHFAAIAGP GPLVGPVLAA QFGYLPGTLW
ILVGAVLGGC VQDFVTLVFS VRRDGKSLTE MAREEIGKVG GFVAFFAVVA IIIILLGAVA
LIVVNALKGS PWGTFTIGMT IPIALLMGLY LRRIRPGKVM EASVLGFICV MAAIFGGQWV
SHTAYAGWFT YSATTLAIAI IIYGFAASAL PVWLLLAPRD YLSTFVKLGT IAILAVGIVI
SRPQLHMPAL TRFIDGTGPV FAGNLFPFAF ITIACGAISG FHALISSGTT PKLIQRETET
RLVGYGAMLC ESLVAIMATI AACVIQPGVY FAVNSPAGIV GANPADAAAK ITSWGFPVDA
AQMAQLAHAV GEQTLFNRTG GAPAFAVGMA HIFAHSLGGE AVMAIWYHFA IMFEALFILT
VLDAGTRVGR FMLQDALGHV WKPLGRTSSY PSIVLTSAII VGLWGYFLWQ GVKDPLGGIN
SMWPLFGISN QLLAAVALCV ATTIVIKMGR ARYAFVTLVP LIVLVAITFG AAYEKVLNPN
PRIGFLAHAR QLASQPNISH QTSQLIFNDR LDAAITTVLV FLVSLVVIES IIEWVRVLSG
RKAATVREAP FVASHFAEEQ A