Gene Acid345_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1899 
Symbol 
ID4073361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2279703 
End bp2281388 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content59% 
IMG OID637983909 
Producthypothetical protein 
Protein accessionYP_590974 
Protein GI94968926 
COG category[S] Function unknown 
COG ID[COG2989] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.131383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000892254 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCCACTAC GTCGCCGATT CACGGCCTTC GTGTACCTGC TGATGTTTGG CCTGCTCCTC 
CCGCACGTCG CGCACGCGCA GAAGAAGCCT GCATCCGCGA AGCCCGCTGG ATCCCCCGCT
CTCCAATCTG CCATTCGCGC CGGACAGCTC CCCGACATGC GTTGGCCCAA CTTCTCCGAC
TATCGCGTCC AGATCGATAA CTTCTACAAG GCGTCGAACT ACTCTCTCGC GTGGATTGAA
GCTGGCACGC CGACCGACCA CGCCCGCCAG ATGATCGCGA TCCTCAGCGC TGCCGACTCA
CAAGGTCTCA ACGCCGAAGA TTACGACGGA CTCCGCTGGC CTGACCGCAT CTCGAAGCTC
GCTGCTGCAC ACGCGCCTGA AGACGAGGAC GTCTTCGATC TCGCGCTCAC CGTCAGCACC
ATGCGTTACA TATCCGACAT GCATATCGGC CGCATCAATC CCACCCACTT CCAGTTCGGC
CTCGATGTGG AACACAAGAA GCTCGATCTC CCCAGCTTCG TACGCAACAT GCTCAGTTCA
CCTGACGACC TCACGCACAC AATCGCCAAG GTAGGCCCAC CGTTCGCCGG ATACGAAGCC
ACCCGCCAGG CCATGCTGCA GTACACCCAA CTAGCCAAGC AGCCCGATAC CGAGAAACTC
CCGCTTCCGG TCGGCGTGGT GTACCAGGGC GGCTACTACG ACCACATGCC CGCCCTCGCC
AAGCGCCTCC AGCAACTTGG TGACCTCGAT CCCAAAGTCA TCATCCTGGC GGATGCGATC
AAATACGACG ACCCTCTCAT GGGCGGCGTC GCGCACTTCC AGTCGCGTCA CGGCCTGCCC
AATGACGGCA ATCTCACCTC CGACACGATC GATGCGCTGA ACATACCCAT CGCCGATCGC
CTCGAGCAGC TAAAGCTCGC GCTCGAGCGC TATCGCTGGA TCCGCTATCA ATTCACTTCA
CCTCCTGTCG TGGTCAACGT GCCGGAGTTC AAGCTCTTCG GCTATGACGG AAGCGGCACG
CAGATCCTAT CCATGGGGGT GAATGTTGGC GACGCCTTCG ATTTTCAGAC GCCTATCTTC
GAAGGTGACA TCCGCTATAT CGTCTTCCGG CCCTATTGGT ACGTGACGCC CACGATCCAG
CGCGACGAGA TGGTGCCCTC TGTCGAAGAA GACCGCACCT ATCTCGAACA GAATGAAATG
GAGGTCGTGG ATAAGGACGG CAAGGTCATC GCCTCCGGCG CAATCTCAGA CGCAGTGCTC
AAGCACCTGA AGAACGGCTC GTATTCGATC CGTCAGCGTC CAGGCGCGGA CAATGCGCTC
GGCCTCGTGA AGATCATCTT TCCTAACTCG CATAACGTTT ATCTGCACGA CACGCCTGAG
TTCAAGACCA TGTTCTCGAA GGCACCGCGT GCATTGAGCC ACGGATGCAT CCACCTCGAA
AAGCCCGCCG ATCTCGCCTA CTGGCTCTTG CGCGACAAGA CCGATTGGTC GCTGGACAAA
GTGAAAGAAG CCATGCAGCA CGGACGCGAC AACTCCAGCG TGACCCTTAC TAAGCCCGTG
CCGATCCTCA TCCTCTACGT AACCGCCCGC GCCCAGACCA ATGGCACTGT CCAGTTCTTT
AAAGATATCT ACGGCCACGA CGTCGAACTC AAAGCTGCGC TGGCGAAGGG CTATCCGTAT
CCGTAG
 
Protein sequence
MPLRRRFTAF VYLLMFGLLL PHVAHAQKKP ASAKPAGSPA LQSAIRAGQL PDMRWPNFSD 
YRVQIDNFYK ASNYSLAWIE AGTPTDHARQ MIAILSAADS QGLNAEDYDG LRWPDRISKL
AAAHAPEDED VFDLALTVST MRYISDMHIG RINPTHFQFG LDVEHKKLDL PSFVRNMLSS
PDDLTHTIAK VGPPFAGYEA TRQAMLQYTQ LAKQPDTEKL PLPVGVVYQG GYYDHMPALA
KRLQQLGDLD PKVIILADAI KYDDPLMGGV AHFQSRHGLP NDGNLTSDTI DALNIPIADR
LEQLKLALER YRWIRYQFTS PPVVVNVPEF KLFGYDGSGT QILSMGVNVG DAFDFQTPIF
EGDIRYIVFR PYWYVTPTIQ RDEMVPSVEE DRTYLEQNEM EVVDKDGKVI ASGAISDAVL
KHLKNGSYSI RQRPGADNAL GLVKIIFPNS HNVYLHDTPE FKTMFSKAPR ALSHGCIHLE
KPADLAYWLL RDKTDWSLDK VKEAMQHGRD NSSVTLTKPV PILILYVTAR AQTNGTVQFF
KDIYGHDVEL KAALAKGYPY P