Gene Acid345_1532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1532 
Symbol 
ID4073020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1869474 
End bp1870784 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content59% 
IMG OID637983541 
Productaminotransferase 
Protein accessionYP_590608 
Protein GI94968560 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases 
TIGRFAM ID[TIGR00700] 4-aminobutyrate aminotransferase, prokaryotic type 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA CGATTCGGAA ATACCAGGAA TACGTCATTA CCAGCTTCGT GAAGGCCGTG 
CAGCCTGTCG TTATTGAAAG CGCCAGCGGC GCGATCATTA AGGACATCAG CGGGCGCGAG
TTCATCGACT GCTTTGCCGG CATTTCGGTG GTGAATGCCG GACATTGCAA TCCGAAGATC
AACGCCGCTG CCAAGGCGCA GATTGATAAG CTCGTGCATT GTGGCTCGTA CATCTATCAC
AGCCAACCGA CCGCGCAATT GGCGGAGAAG ATGGCGAAGA TCACGCCGGG GCGGTTGAAA
AAGTCCTTCT TCGCCAACAG CGGCGCCGAA GCGATCGAGG GCGCGATGAA AGTTGCGCGC
CTCTTCACCG GCAAGCACGA GATCATTTCG CTGCAGCAGT CCTTCCATGG ACGCACCTGG
GGCACGCTGA GCATCACTGG CAACCAAGGC CGCAAGAAGC GTGGCGGCCC GTATGCTCCG
GGCATCGCAT TCGCGCCGGC ACCGTATGCC TTCCGCTCGC CATGGCCTAA TGAGCCAGAG
AAGTTCGCTT CCTACTGCGC GAAACAAGTG GAAGAAACAA TCCGATACTC AACCTCCGGC
GATGTCGCGG CATTCATCGC CGAACCGGTG ATGGGTGAAG GCGGCATCAT CGTCCCGCCG
CAAAACTACT TCCGCGAGGT GAAGGAAGTC CTCGATCGCC ATGGGATTTT GTTCATTGCC
GACGAAGTAC AATCGGGCTT CGGCCGCACC GGTAAGATGT TTGCGATCGA ACACTACGAC
GTCGAACCTG ACATCCTCGT CACCGCCAAG GGCATCGCCA ATGGATACCC GATCGCAGCG
TTCACAACGC GTGATGAGAT TGCGGCTGCA TTCAAACCTG GCGACCACCT GTCGACTTTC
GGTGGAAATC CGATTTGTTG CGCGGCTGCG CTCGCCAACA TCGAATTCTT CGAAGAAGAA
AAGCTCTGTG ATCAGTCAAC CGAGAAGGGC CAACACGCGC TCACCCGCTT AAGAGCGCTG
CAGGGGCGGC AGTCGGGCAT TGGCGAGGTT CGCGGGCTCG GCCTCATGAT CGGCGTGGAA
CTGGTGAAAG ACGACCATCT CACACCTGCG GCCGCCGAAG CGGAAGCCGT GCGCGACACC
TGCTTCAAAG CCGGTGTGCT GATCGGAGTC GGCGGCACGA ATGCCAACGT TCTCCGGCTT
CAGCCGCCGC TCGTCATTAC CTACGAACAG CTCAACACCG CACTCGATGT GTTGGAGGGC
GCGATCACCG AAGTAGTGGG CCGCACCGCG GCCGTTGCGG GCAAGGCATA G
 
Protein sequence
MSDTIRKYQE YVITSFVKAV QPVVIESASG AIIKDISGRE FIDCFAGISV VNAGHCNPKI 
NAAAKAQIDK LVHCGSYIYH SQPTAQLAEK MAKITPGRLK KSFFANSGAE AIEGAMKVAR
LFTGKHEIIS LQQSFHGRTW GTLSITGNQG RKKRGGPYAP GIAFAPAPYA FRSPWPNEPE
KFASYCAKQV EETIRYSTSG DVAAFIAEPV MGEGGIIVPP QNYFREVKEV LDRHGILFIA
DEVQSGFGRT GKMFAIEHYD VEPDILVTAK GIANGYPIAA FTTRDEIAAA FKPGDHLSTF
GGNPICCAAA LANIEFFEEE KLCDQSTEKG QHALTRLRAL QGRQSGIGEV RGLGLMIGVE
LVKDDHLTPA AAEAEAVRDT CFKAGVLIGV GGTNANVLRL QPPLVITYEQ LNTALDVLEG
AITEVVGRTA AVAGKA