Gene Acid345_1534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1534 
Symbol 
ID4072925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1872299 
End bp1873660 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content59% 
IMG OID637983543 
Product4-aminobutyrate aminotransferase 
Protein accessionYP_590610 
Protein GI94968562 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases 
TIGRFAM ID[TIGR00700] 4-aminobutyrate aminotransferase, prokaryotic type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.46092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.55252 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCAA TTGTTTTGCG AACCAAGGTT CCGGGACCGA AAGCGCTGGA GCTTGCGAGC 
CGCCGATCGG CCGCGGTTCC TCGCGGTATC TACGCATCGA CGCCCATCTA TGTGTCGCGT
GCGGAAGGCG CGCTGATCGA GGATGTGGAT GGCAATACCT TCATTGACCT CGCCGGCGGC
ATCGGCGTAA TCAATGTTGG GCATCGCTCA CCTGCCGTCG TCGAAGCCAT TCATCGTCAG
ACCGACCGCT TTCTCCATAC CTGCTTTCAG GTTGTCGGAT ACGAGAGCTA CATCCGTCTT
GCGGAGAAAT TGAATGAGAT CACCCCCGGC GAATTTCCAA AGCGTACGTT CTTCGTGAAC
TCCGGCGCGG AAGCCGTCGA GAACGCGGTG AAGATCGCGC GCTATCACAC CAAACGTCCC
GCGGTTATCT GCTTTGAAGA TGCGTTCCAC GGGCGGACCA CGCTGGGGAT GGCCCTCACC
AGCAAGACGC ATCCGTACAA AGCTGGGTTC GAGCCATTCC CTAGCGAGAT TTATCGCATC
CCCTACGCGT ACTGCTATCG CTGCTCCTAT GGGAAGAAGT ATCCGAGTTG CGAAGTTGCG
TGTGCCGATG CTTTAGAAGG CGTCTTCAAG CGCACTGTGG CAGCAGAATC CGTGGCGGCG
ATCATTATTG AACCGGTGCT CGGCGAAGGT GGGTTCGTGA CCCCGCCCAG CGATTTCTTG
CGAAAGCTGC ACGGCATCTG CAAGCAGCAC GGCATCGTCT TCATCGCCGA CGAAGTACAG
ACCGGCTTCG GCCGCACCGG CGCGATGTTT GCCTGCGAGC GCTACGGCGT TGAACCGGAC
ATTTTGATCG GCGCGAAATC TCTCGGCGGT GGTTTGCCAA TTGGATCCAT CACGGGCCGC
GCAGAAATCA TGGATGCGCC CATACCGGGC GGCATCGGCG GAACGTTCGG AGGCAGTCCT
CTTGCGTGCG AAGCCGCGCT GGCAACGATT GAAGCCATGC AGCGCCAGGA TCTTCCGGCG
CGCGCTAACG CACTGGGTGA ACGCTTCCGG GCCCGTGCCC TGCGGTGGCA AGCGCAGTGG
CCGCAGATTG GCGAAGTGCG CGGCCTTGGC GGGATGCAGG CGATCGAACT GGTGCGCTCG
GCCGAGTCAC GAACTCCCAA CGACTCCGCG ACGAAGCACA TCATCCAATA TTGTTATGAG
CGCGGCGTGA TTACTCTCAA CGCGGGCACG TATAGCAACG TCATTCGCAT TCTCATGCCG
CTCGTCATTT CCGATGCGCA ATTCGAAGAA GCGCTCGACG TAATGGAATC GGCCCTCTCG
CACGAGTTTG CGACTTCTCG TGTGACTACC GGAGTTTCAT AG
 
Protein sequence
MASIVLRTKV PGPKALELAS RRSAAVPRGI YASTPIYVSR AEGALIEDVD GNTFIDLAGG 
IGVINVGHRS PAVVEAIHRQ TDRFLHTCFQ VVGYESYIRL AEKLNEITPG EFPKRTFFVN
SGAEAVENAV KIARYHTKRP AVICFEDAFH GRTTLGMALT SKTHPYKAGF EPFPSEIYRI
PYAYCYRCSY GKKYPSCEVA CADALEGVFK RTVAAESVAA IIIEPVLGEG GFVTPPSDFL
RKLHGICKQH GIVFIADEVQ TGFGRTGAMF ACERYGVEPD ILIGAKSLGG GLPIGSITGR
AEIMDAPIPG GIGGTFGGSP LACEAALATI EAMQRQDLPA RANALGERFR ARALRWQAQW
PQIGEVRGLG GMQAIELVRS AESRTPNDSA TKHIIQYCYE RGVITLNAGT YSNVIRILMP
LVISDAQFEE ALDVMESALS HEFATSRVTT GVS