Gene Acid345_1537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1537 
Symbol 
ID4072928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1876675 
End bp1878045 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content59% 
IMG OID637983546 
Productglutamate-1-semialdehyde 2,1-aminomutase 
Protein accessionYP_590613 
Protein GI94968565 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.033412 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.385317 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGACG CATTGCGGCA GGAAATCGAA CTTTTTGAAA AGCGCACACC CAAATCAGCC 
GAAGCCCACA AGCGCAACCT GAAGCGTTTA CCGCTCGGCG TTGCAAGTAA TTATCGCGCT
TACGATCCGT ATCCAATTTT TGTTAAGGAC GCTTTCGGCT CGAAGTTCCG CGACCTCGAC
GGGAACGAAT ACATTGACCA CAACCTCACG TTTGGCGCGT TAATGGCGGG CCATTGCCAC
CCGGCAGTAA TGAAGGCGGT GGAGAAACGC CTGACCACCG GCACGATGTT TGGCATGCCG
CACGACATGG AGTGGGAACT CGCGGAAGAG ATCTGCGCCC GCTTCCCTAT CGAAATGTTG
CGCTTTGCCT CGACTGGGAC CGAGGCCACA ATGCACACCG TGCGTCTTTG CCGCGCGGCG
ACCGGTCGCG ACAAGATCAT CAAGTTCGAA GGCGGTTACC ACGGATTGCA TGACGCTGCA
CTAGTCAGTG TGAAACCGAA GCTGGAGCAG ATCGGCGATC TCAAAGCTCC GATTGCCGTT
CCCGGTGGAC AGGGTGTCCC GAAGACGGCA GTGGCCAATG TGCTGATTGC CAGCTTCAAC
GACCTTGAGA GCGTGGAACA TCGCTTCAAG ACCCATCCTA ACGAGATCTC TGCGATCATC
CTCGAACCGG TGATGATGAA CGTCGGCATC TGCATGCCGG AACCGGGCTT CCTCGAAGGA
CTGCGCGAAT TGTGCGACAA GCACGGTGCG CTGCTGATCT TCGACGAAGT GAAGACCGGC
GCCAAGCTGG GCTGGGGTGG AGCGTCCGAA TACTTCGGCG TGATACCCGA CGCCATCTGC
CTGGCAAAGT CGATCGGCGG CGGCCTGCCG CTCGCTGCGT TCGGCGCCTC GAAGAAAGTG
ATGGGACTCA TCTCCGATCA CAAGGTCTTC CATGGCGGCA CCTACAACAC CAACCCCGTA
TCGATGGCCG CTGGCTTAGC GACCTTCCGC GAAGTCCTGA CTCGCGAGAA CTACGCGCAT
GTGGAGAAGC TCAGCCACAA GCTCGTGACT GGCTATCGCA GAGTCGTCGA AGAAGTCGGG
CTGGACGCTT ACCTGGAAAT TGCCGGCGCA AACGGCGTAC TGATGTTCGC TCCGAAGCGC
GTGCGTAACT ATCGCGATTG GCTCGAAGTC GACGCGAGTC TCTGGCAGCA GTATTGGTTC
GCCATGGTCA ATCGCGGCGT AATGCCACAG CCTTACTGGT GGGACGAGCA GTGGACCATG
TCCGTCGCAC ACACCGATGC GGATACCGAG AAGCATCTCG CGGTGTTTGG CGAGATCGCT
CCCGCCTTGG CGGTCGCGCA GAAGGAACCG CGCGAGGCAG TGGTGCACTA A
 
Protein sequence
MHDALRQEIE LFEKRTPKSA EAHKRNLKRL PLGVASNYRA YDPYPIFVKD AFGSKFRDLD 
GNEYIDHNLT FGALMAGHCH PAVMKAVEKR LTTGTMFGMP HDMEWELAEE ICARFPIEML
RFASTGTEAT MHTVRLCRAA TGRDKIIKFE GGYHGLHDAA LVSVKPKLEQ IGDLKAPIAV
PGGQGVPKTA VANVLIASFN DLESVEHRFK THPNEISAII LEPVMMNVGI CMPEPGFLEG
LRELCDKHGA LLIFDEVKTG AKLGWGGASE YFGVIPDAIC LAKSIGGGLP LAAFGASKKV
MGLISDHKVF HGGTYNTNPV SMAAGLATFR EVLTRENYAH VEKLSHKLVT GYRRVVEEVG
LDAYLEIAGA NGVLMFAPKR VRNYRDWLEV DASLWQQYWF AMVNRGVMPQ PYWWDEQWTM
SVAHTDADTE KHLAVFGEIA PALAVAQKEP REAVVH