Gene Acid345_2055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2055 
Symbol 
ID4070597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2464812 
End bp2466575 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content57% 
IMG OID637984069 
Producthypothetical protein 
Protein accessionYP_591130 
Protein GI94969082 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2982] Uncharacterized protein involved in outer membrane biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.429866 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGTGG TTAAGCGCGG GCATCGTGAA AGCCAAAAAG AAGGCGCATT TAGAAACGAA 
GCGAGACTGT CACTTTCTCC AGTTTCGCGA AGTCAGCCAT TGGGACTAAA GTGGCGGGCG
TCGACGTGCG CCCCGCCGAA CAGGAGCTCA GCCATGAACA AGCGCGTACT GAAAATTGCA
GGCATCGTCG TTGGCGTCAT CCTGGTTATT CTGATCGCGA TTCCGTTGTT CGTGAATGTC
GAGACATTCC GTCCAAAGAT TGAAGCGTCG TTGAGCGAAG CGCTCGGCAG AGAAGTGAAG
CTGGGGAAGA TGAGCCTCTC CATCTTCAGC GGCAGCCTGA GCGTGGACAA CATCTCGATT
GCAGACGATC CGGCGTTCAG CAAGGCACCG TTCGTCAGCG CGAAATCATT GAAGATTGGC
GTGGAATTGG GAGCGCTGAT CTTCTCCAAG CAGATCAAGG TCACTGGCTT CAAGTTGGAG
AAGCCGGAGA TCATGCTGCT CAGCGCGCCG AACGGGACGT GGAATTTCTC CACGCTGGGG
GCAAAGAACG CACCGAAGAC AAGCCAGGCT TCGTCGGGCG CGCCGTCGGA CGTCACGATC
GCGCACCTAT CGATTGATGA CGGCAAACTG ACCGTAGCGA AGGCGAATTC GTCGGCGAAG
GCACAGGTGT TCGACAAACT GGATGTGAAG GTCAACGATT TCTCGATGGC CTCACAATTT
CCGTTCGAGG TAAGCGTCGA CTTGCCGGGC GGCGGCGATG CGAAGATCAC GGGAAAGGCC
GGGCCGGTCA ACCAGCAGGA CGCAGCGAAG ACGCCGTTCG ATGCGAAGCT CAAGGTCAAC
AAGCTCGATG TGGCAGCATC GGGATTTGTG GATGCGTCCA CAGGTATTGG CGGGATTGTT
GGGCTGGAAG GAACACTGAG TTCCAACGGC ACGCAGGCGA AGGGTGCGGG TGAAGTCACG
ATGGCGAAGG CGAAACTATC GCCAAAGGGA ACACCTGCGC CGAAAGACAT CACGCTGAAG
TACGCACTCA CCGCGGATTT GGAGAAGCAG AGCGGAAACG TATCTCAAGG CGATATCGGG
ATCGGCAAAG CTTCAGCGCG GCTAACTGGC GTCTACAGCA CTCACGGCGA CGTGCAAAGC
CTGAACATGA ACCTGAATGC GCCAAACATG CCGGTGGATG AGCTCGAAGC GTTCTTGCCG
GCGATGGGTG TAACACTACC CTCGGGGTCG AAGCTGGAGG GCGGAACACT TTCGGCGGAT
TTGGCAATCA ACGGGCCGCT CGACAAGTTA GTGATCACTG GTCCTGTGAA GCTGAGCAAT
ACGAAGCTGG CCGGATTCAG TGCGACGTCG AAGCTCAGCG CGCTGTCGGC ATTTGGCGGC
AAAGTTCCGC AAAGTCCGGA TACAACGATC CAGAATGCGA GCTTGAATGC GCGGGTGGCT
CCGGAAGCGA CCAAGTTGGA CGCGATTAAC GTGGTGGCGC CGTCGTTGGG AACGGTAACG
GGTGCGGGCA CCATCAGTCC GGCGGGAGCA TTGGATTTCA AGATGCAAGC TGATATGAAA
GCGACGGGAG CGATTGCACA ACAGGCAGGG GTCGGTGGGA TGAATGGACC GATTCCGTTC
AGCATCCAGG GGACGACTTC GAATCCAACG TTCGTGCCGG ACGCGGGCGC GATTGCATCG
AGCGTTGCGA AGAGCGCGAT CCAGAAGCAA CTGGGGGACA AGTCGGGAAT TACGAGCTTG
TTCGGCAAGA AGAAATCGAA ATAG
 
Protein sequence
MFVVKRGHRE SQKEGAFRNE ARLSLSPVSR SQPLGLKWRA STCAPPNRSS AMNKRVLKIA 
GIVVGVILVI LIAIPLFVNV ETFRPKIEAS LSEALGREVK LGKMSLSIFS GSLSVDNISI
ADDPAFSKAP FVSAKSLKIG VELGALIFSK QIKVTGFKLE KPEIMLLSAP NGTWNFSTLG
AKNAPKTSQA SSGAPSDVTI AHLSIDDGKL TVAKANSSAK AQVFDKLDVK VNDFSMASQF
PFEVSVDLPG GGDAKITGKA GPVNQQDAAK TPFDAKLKVN KLDVAASGFV DASTGIGGIV
GLEGTLSSNG TQAKGAGEVT MAKAKLSPKG TPAPKDITLK YALTADLEKQ SGNVSQGDIG
IGKASARLTG VYSTHGDVQS LNMNLNAPNM PVDELEAFLP AMGVTLPSGS KLEGGTLSAD
LAINGPLDKL VITGPVKLSN TKLAGFSATS KLSALSAFGG KVPQSPDTTI QNASLNARVA
PEATKLDAIN VVAPSLGTVT GAGTISPAGA LDFKMQADMK ATGAIAQQAG VGGMNGPIPF
SIQGTTSNPT FVPDAGAIAS SVAKSAIQKQ LGDKSGITSL FGKKKSK