Gene Acid345_2062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2062 
Symbol 
ID4070604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2472620 
End bp2473756 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content54% 
IMG OID637984076 
Producthypothetical protein 
Protein accessionYP_591137 
Protein GI94969089 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0964695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.390374 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGAA TACCAGGCAC CCCGCTCGAA ACTCTCGAAC TGCCTGAAGA TCCGCCCAAA 
CAGACAGACC CGTGCTGGTG CCGGTCAGGC ATCGAATTTG GAAAATGCCA TTTCGAACGG
CATCTACAAC CGCGAGAATC CCCCTGGAGT GCTCTCAAAG AAGCGTCCAG GCTGAATGAC
GCGAAATACT GTGGGCATCC ACTGGCCTCA CCGGTCACAT GCAGCGGCAA AATCGTTCGC
GCGCACACGG TTCAACTGGA AGGTGCCTTG AGCACCATCG CCGTCGATCG CCACGTTTAT
GGGCTTGCAA TGAAGGACGG AAGGTTGGAG TACGGACTCA TCGGTCTACG AAAGGCGTCA
ACGTTTTCGG GGTTTTGCTC TTATCACGAT GCCGAGCTCT TTCGCGCGCT GGAAACGAAA
CCCTTCACCG CAACTAAAGA ACAGTTGTTT CTGCTCGCCT ATCGCGCACT TTCAAAAGAG
GTCTACGCGA AGCGATACGC TATCCGCACC ATTCCCCTTC AACGCAGGCA GGACAAGGGC
TCCGACGCCT TGCACCAGGT GAACGTTCAG AGTTACCTCT ACCTCCGGGA GCAAGCGTTG
AGGCTCGGAT TGCGCGATCT GGAGTCCGCA ATAGCAGATT ATGACAAAGC TCTCTTGGCA
AGGGACCATG ACCGGTTTTC TGGTTATTTG GTATTTACCG ACAAAACCCC GGATCTCGCG
GTCAGTGCAG CGATGTTTCC GGAGTTCGAC TTTCAGGCAA ACGCGCTCCA ATCGCTGTCT
AGTGCTGAAT GTCTTGACCT CCTGACCTAC ACAGTGTTGC CAATGTCGTC TGGGGGTGTC
ATCGCTTTTG TGTGGGATTC GAAAAGCGCC AGGTCGTGCG AAAAGCTCGT TGCTAGCCTT
GATCGCCTCC CTATGCGAGA GCTGCCGGAT GCACTGATTC GATTCACCTA CGAGTACTTC
GAAAATTGCT TTGCGAATCC TCTTTGGTGG GATTCGCTTT CCGGGGTGCA GAGAGAACGC
TTGCTCGCGA GGATTAATCT AGCGGTCTCT CCGACAGATG ACCGTACACC AGACTGTCTG
AAAGATGACG GACTGCGGAC GGCAAGGTGG AACGTAATTG CCAAGGAGTG GTTTTAG
 
Protein sequence
MARIPGTPLE TLELPEDPPK QTDPCWCRSG IEFGKCHFER HLQPRESPWS ALKEASRLND 
AKYCGHPLAS PVTCSGKIVR AHTVQLEGAL STIAVDRHVY GLAMKDGRLE YGLIGLRKAS
TFSGFCSYHD AELFRALETK PFTATKEQLF LLAYRALSKE VYAKRYAIRT IPLQRRQDKG
SDALHQVNVQ SYLYLREQAL RLGLRDLESA IADYDKALLA RDHDRFSGYL VFTDKTPDLA
VSAAMFPEFD FQANALQSLS SAECLDLLTY TVLPMSSGGV IAFVWDSKSA RSCEKLVASL
DRLPMRELPD ALIRFTYEYF ENCFANPLWW DSLSGVQRER LLARINLAVS PTDDRTPDCL
KDDGLRTARW NVIAKEWF