Gene Acid345_0568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0568 
Symbol 
ID4073057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp694732 
End bp695931 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content61% 
IMG OID637982573 
ProductL-aspartate aminotransferase 
Protein accessionYP_589647 
Protein GI94967599 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0641673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCTG CAACATCGCA ACTGACGCTC AGCCCCGCCG CGCGGATGAA CCGCATTGAA 
ATTTCGGCAA CCCTCGCGGT TGTGAACGAA GCCGAGAAAC TGCGCTCTGC CGGGGTGGAT
CTGGTGGACT TTGGCGCGGG CGAGCCGCAC TTCGGAACGC CGCAGCACAT CCGGGAAGCG
GCGATTGCGG CGATCCATAA CAACTTCTCG AAATACACGG CAGTGGCGGG CACGGCGGAA
CTGCGCGATG CGATTGCGAA GCGGCACGCC ACAGACTTCG CCACCGACTA CAAGCGCGAG
GAAGTGATCG CTTCCGTGGG CGGCAAGCAT GCGCTGTTCA ACGCGATCCA GGTGCTGGTG
GACCACGGCG ATGAAGTCAT CATCCCGGTG CCGTACTGGG TGTCGTTCAA AGACATGGTG
CAGTACTCGG GCGGCAAGCC GGTGTTTGTA GAAGCGGATG AGAGCCAGAA CTTCCGGCTG
ACGGCGGCGA TGGTCGAGAA GGCCGTGACG CCGAAGACGA AGCTGATCAT TTTAAATTCG
CCGTCGAACC CGTCGGGCGC AGTGATGGCG CCGGAGGACA TGAAGTCGAT AGCGCGCTTT
GCCTATGAAC GCGGGATTTG GGTCATCTCC GATGAGTGCT ATGTGTATCT GAACTACACC
GGCGAAGAGT TTTCGCTGGG CAGCCTGACC GAAGTGAAGG AGCGGCTGCT GGTGGTGGGA
TCGCTTTCGA AGACCTACGC CATGACCGGA TGGCGGCTGG GCTACACGCT GGCGCCGGCG
GCGGTGGTGA GCCAGATGCA GAAGCTGCAA AGCCAATCGA CGTCGAACCC GACCTCAATT
GTGCAGAAGG CAGCGGTGGC GGCGTTGAAT GGTCCGCAGG AGTGCGTCGC CGAGATGCGC
GCCGACTACA TTAAGCTGCG CGACGAGATC GTGAGTGGGC TGCGCTCGAT TCCGGGCGTG
AAGTGCACCA TGCCACAGGG CGCGTTCTAC GCCTATCCGG ACATCAGCTG CGCGTTTGGC
AAGGCAGGGA TGAACTCGGC GGCCGACGTC GCGAAGAAGC TACTGCACGA GGCACACGTG
GTCTCGGTCC CGGGCGAGGC GTTCGGCACA ACCAAACACA TCCGGCTGTC GTACGCGGCT
TCGCATGAGA ATGTGGCGCG CGGTTTGGAG CGGATGCACA AGTTCTTCGC CAGCCTTTAA
 
Protein sequence
MSSATSQLTL SPAARMNRIE ISATLAVVNE AEKLRSAGVD LVDFGAGEPH FGTPQHIREA 
AIAAIHNNFS KYTAVAGTAE LRDAIAKRHA TDFATDYKRE EVIASVGGKH ALFNAIQVLV
DHGDEVIIPV PYWVSFKDMV QYSGGKPVFV EADESQNFRL TAAMVEKAVT PKTKLIILNS
PSNPSGAVMA PEDMKSIARF AYERGIWVIS DECYVYLNYT GEEFSLGSLT EVKERLLVVG
SLSKTYAMTG WRLGYTLAPA AVVSQMQKLQ SQSTSNPTSI VQKAAVAALN GPQECVAEMR
ADYIKLRDEI VSGLRSIPGV KCTMPQGAFY AYPDISCAFG KAGMNSAADV AKKLLHEAHV
VSVPGEAFGT TKHIRLSYAA SHENVARGLE RMHKFFASL