Gene Acid345_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1036 
Symbol 
ID4073123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1301347 
End bp1302897 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content58% 
IMG OID637983043 
Producthypothetical protein 
Protein accessionYP_590113 
Protein GI94968065 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.977516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGTTTTC AGAAGCCCCT GTTTCTGCTC GCAATTCTCT CTTTATCTCT TTTCGCGTCA 
GCAGCCCAGA AGGGGCATGA CGCAACGCTT ACGGTTGGCG TCCAGGGTGG TTCAGGCGTG
GCCGCTTCGG ACATTTCCGT GACCCTCGGG GGGAAAGGCG CCGCCCTTGT AGCGGCTCCG
GTACCGAAGA GCCTGATAGT TCCGGATCTG CAACTGGATC AATACACTAC AGAGCTGCCG
GAGGCGGTCC AGAGCCCGAT TATTTTCGTC TTCGATGCGA TGGATACGAG AGAGATCGAA
GAACGCGACA TGCGGAAGCA TGTGCTGAAA TTCATGGCCG ATGCTGCGGC GAAGAAGCAG
GCGGTGGGGC TGGTGATGAT GTATCGCGAC GGCGTTTTCG CGATCCATGA CTATCGCATG
GGATCGGCGG TGCTGGCGGC GGCCCTGGCG AAGGCCAATG GTGGATCCGC TCCGAGTGTT
CCCGGTGCCG AGCAACGAGT TGCAATTGAA GCGAAGCTGC TGGCAGAGTT CGCTAAAGGC
ACATATTCAA CCCAGGCGAG CGACAACACC GTACTCTTGA CGACGCTGGA TGCGCCGGTC
CTGATGATGC AGGAAGTGGG AGCAGCACTG CGCGACCTTC CGGGACGCAA GGCGATTGTA
TGGGTCACGG CAGGTGTGCC GTTCGAGATC GAAGAACGCG ATCACAGCCT CACGACCCAC
CTCGAACTGA ATAGCGGCGT GGCTGTGAAC GGCGCTTCGG TGACGAGCAC AAAGCGCACG
GCAACCGACG ACCAGATCCG AAAACTACAA CCGATGTGGA GAGCGACGCT CAACAACCTT
TGGGATTCCG GTACCGCGGT TTACCCGGTT GAAGCTCGCA GCAATTTTTC TCCACCGGCG
GGGCGCGTTT ATACGTCCAC GATGACCCAG GTTGCCGAAA TGACGGGCGG CAAGGCTTTC
TACGGGAGCA ACGATCCTTC CTCCTTTTTC AACTCGATCG TGAGCGACAA TGCGAATTCG
ATCCGGGTCG CATTTCCAAT CGAAGGTTCG AACGACAATT GGCAGAAGTT GAATGTGACT
TCGCCGAAGG GAAAAGTCTT TGCGCCCACC GGGATGTTCG TACCGCCGGA CCGCAGCGCC
GACGATCTTC GCAAGAACGC GATCAGCACA GCACTGAATT CACGCTTTGG TTTCGGGGGA
ATGCCGTTCC GATTGACGCT AGGCGACCAG GGGGCGAGCG GGGCCAAGAA GACGGTGAAC
TTCGTCGTAT TCCTGCCTCC GAATGCGGGC TTTGTGGATG AGAAAACCGG CGAGATCAAC
CTGGACATCG CAGCCGTTGC GTATGGCAAG AAAAGCGAGA AGGCCGGGAC GATGGCGGCA
GCGGTAGCGA CGAAAGTGCC GACCGAAGCG GTGAAACAGA TCGGCGAGAT GGGGTCCAAA
CTCTCGCAGA AAATTGATCT GCCGCCCGGG GATTACACGT TGCGCGTTGT GGTTCGCGAC
AATCTGAATG GACGTGTCGG CTCGATTGAG GCTCGCGTAG AAGTAAAGTA A
 
Protein sequence
MRFQKPLFLL AILSLSLFAS AAQKGHDATL TVGVQGGSGV AASDISVTLG GKGAALVAAP 
VPKSLIVPDL QLDQYTTELP EAVQSPIIFV FDAMDTREIE ERDMRKHVLK FMADAAAKKQ
AVGLVMMYRD GVFAIHDYRM GSAVLAAALA KANGGSAPSV PGAEQRVAIE AKLLAEFAKG
TYSTQASDNT VLLTTLDAPV LMMQEVGAAL RDLPGRKAIV WVTAGVPFEI EERDHSLTTH
LELNSGVAVN GASVTSTKRT ATDDQIRKLQ PMWRATLNNL WDSGTAVYPV EARSNFSPPA
GRVYTSTMTQ VAEMTGGKAF YGSNDPSSFF NSIVSDNANS IRVAFPIEGS NDNWQKLNVT
SPKGKVFAPT GMFVPPDRSA DDLRKNAIST ALNSRFGFGG MPFRLTLGDQ GASGAKKTVN
FVVFLPPNAG FVDEKTGEIN LDIAAVAYGK KSEKAGTMAA AVATKVPTEA VKQIGEMGSK
LSQKIDLPPG DYTLRVVVRD NLNGRVGSIE ARVEVK