Gene Acid345_3294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3294 
Symbol 
ID4072706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3900533 
End bp3902077 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content56% 
IMG OID637985315 
Producthypothetical protein 
Protein accessionYP_592369 
Protein GI94970321 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2982] Uncharacterized protein involved in outer membrane biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTCAC TTCCTACCCT CACCAAAGCC AAGTGGCTTC TGCTGTGTTG CGCCGTAATT 
CTGCTCGTGA CCGCAGGAAC GGCATTGTAT GTGGCCCCAC GATGGCCCTA TCGCAAAGCA
GAACTCATCA GACGACTCGA AGATGCGACC GGCTCTCGAG TCGAGATTGG CCGGTTCCAT
CAGACGTTCT TCCCGCATCC CGGTTGCGTG GCGGAGAACC TGACGCTCAC AAACTCCGAT
CCTGCTCACC CGAAAATTAC AGTGCAGAAA GTGACGGTCA GCGGCCTCTA TTCCGGTTTG
ATCGGGAGCG AGAAGCATAT TCGGATGCTG AGCCTCGAAG GCATGCGCAT CGCTTTCCCT
CCCCAGAATC CAGAATCATC GGGAGGCCAA AACGACGTGC GGCCGCAGAG GGTTGGGGTG
CCAAGTGGCA TCAAATTTGA CCAGGTCGAA GCGCGCAATT CGCAGATTGT TTTCGCTCGT
AAGAACCCAG AGAAAGAACC GCGAACGTTT GCTATTTACG ACGTAATCCT GAAATCATTC
GCGCCCGGTG AGGCAATCCA CTTCAACGCG GTGTTGCGGA TTCCAGTGCC TCCTGCCGAA
GTGAACGTCG ATGGAGTGTT CGGGCCTCTC ACCGGCGATG CGATCGGCAA GGCACAATTG
AAAGGTCGCT TTACGATGAA GAACGCGGAC TTATCCAAGT TCCACGCGTT GATGGGAACG
CTTTCCGCTC AAGGTCAATT TGAAGGCCAG TTAGAGGGCC TGACCGTCAA TGCGACTACG
AATGTTCCGG ACTTCGGCAC GAAGGCAACC CATCACACCA TCCCCCTGAA TACTGACTTC
ACGGCGGTGG TTGATGGAAC CAATGGCGAC GTCCAGTTTC AGCCGGTGCA TGCGCTTCTC
GGGGAAACGA AATTGATCGC GACGGGCAAA CTCGAGGAAG TACCGGATGC GAACGGGAAA
CGCCTGACCC TCCATATCGC TTCACAGGAG GCGCGCATCC AGGATCTGAT GGTTCTGTTC
ACACATTCGA AGCCGCCATT ACAAGGGGCG ACGCGCTTCG ACATGAGCGT GCAATTACCT
CCAGATAAGA AGCCCTTTGA AGAACGACTC CAGGCCACGG CGCATTTCGG CATCCGTGGC
TCGAAATTCA GCAAACAACA AACGGAACAA AAAGTGAGCG ATCTCAGTCA ACGAGCCCAG
GGAAATACAA AAGACGACGA TCCCCCTCCC GTAATGACGG ACCTTTCAGG GGATGTGAAC
CTCGCCGGCG GAACCGCGAA CTTCTCCCGC CTCGACATCA GTATCCCCGG GGCGAGCGCT
GCCCTGCATG GCACATACAA ACTGGAGACC CACGCCGTGG ACCTGCACGG AATGCTTCAC
ACCGACGCGA ACCTTTCGGA CGCAACGACG GGTTTCAAGG CGTTCCTCAT GAAGGTCGTA
GAACTGGCGA AGAAGAAGAA CAAGAATGGC GCAACAGTCC CGGTGAAGAT TACCGGATCG
TATGAGAGAC CTGATTTCGG ATTAGACGCC CCTGCGGAAA AATGA
 
Protein sequence
MRSLPTLTKA KWLLLCCAVI LLVTAGTALY VAPRWPYRKA ELIRRLEDAT GSRVEIGRFH 
QTFFPHPGCV AENLTLTNSD PAHPKITVQK VTVSGLYSGL IGSEKHIRML SLEGMRIAFP
PQNPESSGGQ NDVRPQRVGV PSGIKFDQVE ARNSQIVFAR KNPEKEPRTF AIYDVILKSF
APGEAIHFNA VLRIPVPPAE VNVDGVFGPL TGDAIGKAQL KGRFTMKNAD LSKFHALMGT
LSAQGQFEGQ LEGLTVNATT NVPDFGTKAT HHTIPLNTDF TAVVDGTNGD VQFQPVHALL
GETKLIATGK LEEVPDANGK RLTLHIASQE ARIQDLMVLF THSKPPLQGA TRFDMSVQLP
PDKKPFEERL QATAHFGIRG SKFSKQQTEQ KVSDLSQRAQ GNTKDDDPPP VMTDLSGDVN
LAGGTANFSR LDISIPGASA ALHGTYKLET HAVDLHGMLH TDANLSDATT GFKAFLMKVV
ELAKKKNKNG ATVPVKITGS YERPDFGLDA PAEK