Gene Acid345_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3040 
Symbol 
ID4071947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3608644 
End bp3610089 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content60% 
IMG OID637985059 
ProductD-glutamate deacylase 
Protein accessionYP_592115 
Protein GI94970067 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.236527 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGTCG CATGCGCCCT TTTTGCTGTC GTCGTCCACT TCCATTCACT TCGCGCCAAT 
GCGAGCGGGC GGCCGTATGA CGTCGTGATC TTGAACGGGC GCGTCATGGA TCCCGAGTCG
GGGCTCGATG CGATGCGGAA TGTCGGAATT CGCCACGGGA AGATCGTTGC AGTTTCGACT
GCGGCGATTA CCGGGAAGCG CACGATTGAC GCTAAAGGGT TGGTGGTGGC ACCCGGCTTC
ATCGACATGC ATGAGCATGG GCAGGAGCCG CGCAATTACC AGTTCCAGGC CCACGATGGA
GTAACGACTT CGCTCGAGTT GGAAGTTGGG ACCGACGATG TTGCGCAGTG GTACGCGAAT
CGCGAAGGGA AGGCGCTGAT CAATTATGGC GTGAGCATTG GGCATATCCC GGTGCGAATG
AAGGTGCTGA AAGATCCCGG AAAATGGCTA CCTACAGGTG ACGCGGAGTA CCGTGCTTCT
ACTTCGGAAG AGCTTGCCGA AATCCAACAG AGGATTCAGG CAGGGTTGGA CGCGGGAGCG
CTCGCAGTGG GAATGGGGAT TAACTACACG GCGGCTGCGT CACACGAGGA GATCGTGGAC
ATGTTTCGGA TTGCTGCGAA GAACGGCGCG CCGGTGCATG TGCATTTGCG GTGGGCGGGA
ATCAAAGAGC CGGAGACCGG ACTGGCTGGG CTGGAAGAAG TGATTGCGGC GGCGGAGTCT
ACGGGTGCGC CTCTGCACGT GGTGCATGTC ACCAGCATGG GGCTGCGCGA CACACCACAG
TTGATTGCGA TGATCGAAGG TGCGCAGAAG CGTGGGTTGG ATGTGACCAC GGAGTGCTAT
CCGTACATTG CCGCAAGTAC AGGGCTGGAG AGTGCGATAT TTGAGCCCGG ATGGCAGGAG
AAGATGGGGA TCACGTACAA GGACCTGCAA TGGGTGGGCA CAGGCGAGCG ACTGACGCAG
GAGACATTCG CGAAATATCG GAAGCAAGGC GGGCCGGCGG TGATCTTCTC GATTCCAGAA
GCGGCGGCGA GAACCGCGGT CGCGAATCCG ATGGTGATGA TCGCGAGCGA TGGGCCGCAG
TTCACTGGGC CGAAGGTGCA TCCGCGCGGG AACGGGACGT TTTCACGTGT GCTGGGACAC
TACGTGCGCG AGGAACATGC GCTCGATTTG ATGACCGCGC TGAGAAAAAT GACCTTGATG
CCGGCGCAAC GGTTGGAGAA ACGGACGCCG GAATTCAAGA ACAAAGGCCG CATTCGCGTA
GGCGCTGATG CCGACATCAC CGTGTTCGAT CCGCAACGCG TGATTGATAA AGCGACGTTT
GAAGAGCCGA TGCAGTATTC CGCGGGGATT CAGTTCGTGC TGGTGAATGG AGTGCCCGTG
GTGAGTGACG GCAACCTCGC GGAGGGAGTC TTCCCGGGAC GTGCGGCGCG CGCGCCCGTG
CACTAG
 
Protein sequence
MIVACALFAV VVHFHSLRAN ASGRPYDVVI LNGRVMDPES GLDAMRNVGI RHGKIVAVST 
AAITGKRTID AKGLVVAPGF IDMHEHGQEP RNYQFQAHDG VTTSLELEVG TDDVAQWYAN
REGKALINYG VSIGHIPVRM KVLKDPGKWL PTGDAEYRAS TSEELAEIQQ RIQAGLDAGA
LAVGMGINYT AAASHEEIVD MFRIAAKNGA PVHVHLRWAG IKEPETGLAG LEEVIAAAES
TGAPLHVVHV TSMGLRDTPQ LIAMIEGAQK RGLDVTTECY PYIAASTGLE SAIFEPGWQE
KMGITYKDLQ WVGTGERLTQ ETFAKYRKQG GPAVIFSIPE AAARTAVANP MVMIASDGPQ
FTGPKVHPRG NGTFSRVLGH YVREEHALDL MTALRKMTLM PAQRLEKRTP EFKNKGRIRV
GADADITVFD PQRVIDKATF EEPMQYSAGI QFVLVNGVPV VSDGNLAEGV FPGRAARAPV
H