Gene Acid345_0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0114 
Symbol 
ID4069489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp121595 
End bp122782 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content62% 
IMG OID637982114 
ProductL-aspartate aminotransferase / phosphoserine aminotransferase 
Protein accessionYP_589193 
Protein GI94967145 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0075] Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.31089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.387069 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCATA AGAACCGGCT GTACACGCCG GGGCCGACCC AACTTCTGCC CTCGGCGCAG 
TTCGCCATGG CGGCGGCTAC CATGCATCAC CGTACCGCCG ACTTCCGTGA CCTCTACGTA
CGCACTCTTG CCGACCTGAA ATCGTTTATC GGGACCCAGA ATGACGTCAT TCTCATGGCC
TGCTCCGGTA GCGGCGCCAT GGAGGCCTCG GTTTCGAACC TCACTTCACC CGGCGACAAA
GTTCTCGTGC TCACTGCCGG AAAGTTCGGT GAACGCTGGA GCAGCCTCGC GAAAGCTTAC
GGCGTAAAAA CTGAAGTCGT AAGCGCGCCC TACGGTGAAA CCTGGACGCT GGATGCCGTC
AAAGAAAAGC TCGACGGCGT CAGCGTGGTT TACATGCAGG GCACCGAGAC CTCAACCGGC
GTACGCCATG ACGTCGAAGG CGTAGCCAAG CTGTTGAAGG GCACCGACAC GCTGCTCGTC
GTGGACGCCA TCACCGGCCT CGGCACCACC CACTTCGACG TGGATGGCTG GGGCGTGGAC
GTGATCATCG GTGGCTCGCA AAAAGCTGTC ATGATCCCGC CAGGATTGGC TTTCCTCTCT
GTCAGCGAGC GCGCTTGGAA GAAGATGGAC GCGACGAAGA ACCCGCGCTT CTACTTCGAC
TTGCGCAAAG AACGGAAGTC GGCGGCAAAG GGCGAATCGG CCTACACACC GGCGACGTCG
TTGATCGCGG GACTCGGCGC AGCTTTGGAC TTCATCCGCG GCATGGGCAA CGGTAATTTG
TCTGACGGTC GCCATGCGTT GGTGGAGAAC GCCGAAACCG CGGCGGCCAT GACTCGCGCC
GCAGCCGAAG CTCTCGGGCT CACGATGTTT GCGAAGCACT CGCCGGCAGC GGCAGTGTCG
GCCATCAACG CCCCCGCGGG CGTGGATTCC GGCGCCATCG TGAAGGGTTT CCGCAACCAG
TTCGGCGCGG TGGTCGCAAA CGGTCAGGGT GACGAGATGA AGGGCAAAAT CTTCCGCGTC
GCGCACCTCG GCTATTACGA CTACCTCGAC ACCATCGCAA TCGTGGGCGC ACTCGAGCAT
GTGCTGGCTT CGATCTCGAA GCCCGCGCAC GTTGAGTTTG GACCCGGCCT GCGTGCCGCG
CAGTTCGTTT ACGCACAGCG CGCAACGTTT GCCGCTGCTA CGGTGTAA
 
Protein sequence
MLHKNRLYTP GPTQLLPSAQ FAMAAATMHH RTADFRDLYV RTLADLKSFI GTQNDVILMA 
CSGSGAMEAS VSNLTSPGDK VLVLTAGKFG ERWSSLAKAY GVKTEVVSAP YGETWTLDAV
KEKLDGVSVV YMQGTETSTG VRHDVEGVAK LLKGTDTLLV VDAITGLGTT HFDVDGWGVD
VIIGGSQKAV MIPPGLAFLS VSERAWKKMD ATKNPRFYFD LRKERKSAAK GESAYTPATS
LIAGLGAALD FIRGMGNGNL SDGRHALVEN AETAAAMTRA AAEALGLTMF AKHSPAAAVS
AINAPAGVDS GAIVKGFRNQ FGAVVANGQG DEMKGKIFRV AHLGYYDYLD TIAIVGALEH
VLASISKPAH VEFGPGLRAA QFVYAQRATF AAATV