Gene Acid345_1480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1480 
Symbol 
ID4071650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1790213 
End bp1791439 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content59% 
IMG OID637983489 
ProductL-threonine synthase 
Protein accessionYP_590556 
Protein GI94968508 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTTGC GATGCAACAA CCGGAATTGT GCGCACACGC TGGATCTACA CGAACGCGCG 
GTGGCGTGCC CGAAGTGTGG TGACCTGCTG GAAGTGGTGA TGGAAGCGCC AAAGCAGGAT
CCGGCGCAGG TAAAGCGAAT GTGGCTGGAG CGTCGAATGT CGAACGCCAG CGCCGACCGA
AGCGGCGTGT GGCGTTTCCG AGACCTCCTG CCCGGCCCCT ACTCGTTAGA AGATCTTGTA
ACCCTCTCCG AAGGAAATAC GCCACTGGTA CACGGCTTGA AGACCGGCAA ATCCACCGGT
CTCGACCAGC TGTTTTTCAA GCATCTCGGA TGGAATCCTA CTGGCTGCTT TAAAGACCTG
GGTATGACCA CGGGCATGAC GGAAGCCAAA CATGTCGGCG CGAAGATCGT CGCTTGTGCC
TCGACGGGCA ATACGTCGGC ATCGCTGGCG GCGTATGCGT CGCGCGCGGG ATTGGAAGCC
CACGTCTATC TGCCGAGTGG AAAGATTTCG CTGAACAAGC TGGCTCAGGC GCTGGAGTTT
GGCGCAAAGA TCGTCGAAGT GGATGGCAGC TTCGACGCGG CACTCGATCA GCTTCTTAAC
ACGAAAAACG ACGACATCTA CTTTCTGAAT TCTGTGAATC CGTTCCGCGT TGAGGGGCAG
AAGACGGTCA TCTTCGAGAT GATGGAGCAA CTCGACTGGC GGGTACCGGA CGTGGTGATT
TGTCCCGGAG GGAATCTCGG TAACAGCGCG GCATTCAGCA AAGGGCTGGA GGAGCTGAAG
GAGTTTGGAT TTATCGATCG GTTGCCAAAG CTCGTAGTGG TGCAGGCGGC GGGTGCGAAT
CCATTTGCTG AGCTGTGGCG GACTGGCGCG GACGAACTGA CTCCGACGGA ACATCCGGAG
ACGGTGGCGA CAGCGATTCG AATTGGCAAT CCGCGGTCGT GGCGCAAGGC GCTGCATGGT
GTGAAGATGA CCGGCGGGTT CGTGATGGAA GTAACCGATG AAGAGATCGG CGAAGCAAAG
GCGCTCATAG GCCTCGATGG CATTGGCTGC GAGCCGGCTT CGGCCACTAC GCTCGCCGGG
CTGCGCAAGC TGCATGCGGA AGGCAAGCTC GATCGCGATG CCACGGTGGT CGCGATCCTG
ACGGGCCACG CGCTGAAGGA CACCGACTAC ATCCTGAAGG CCCATGCGAA GGCGAATGAA
GCGCGACTGG CCGAGGTGAA CGGATGA
 
Protein sequence
MHLRCNNRNC AHTLDLHERA VACPKCGDLL EVVMEAPKQD PAQVKRMWLE RRMSNASADR 
SGVWRFRDLL PGPYSLEDLV TLSEGNTPLV HGLKTGKSTG LDQLFFKHLG WNPTGCFKDL
GMTTGMTEAK HVGAKIVACA STGNTSASLA AYASRAGLEA HVYLPSGKIS LNKLAQALEF
GAKIVEVDGS FDAALDQLLN TKNDDIYFLN SVNPFRVEGQ KTVIFEMMEQ LDWRVPDVVI
CPGGNLGNSA AFSKGLEELK EFGFIDRLPK LVVVQAAGAN PFAELWRTGA DELTPTEHPE
TVATAIRIGN PRSWRKALHG VKMTGGFVME VTDEEIGEAK ALIGLDGIGC EPASATTLAG
LRKLHAEGKL DRDATVVAIL TGHALKDTDY ILKAHAKANE ARLAEVNG