Gene Acid345_2764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2764 
Symbol 
ID4072386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3272905 
End bp3274458 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content60% 
IMG OID637984781 
Product4-alpha-glucanotransferase 
Protein accessionYP_591839 
Protein GI94969791 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCACGA AGCGAGCTTC CGGCATTCTT CTCCATCCAT CTTCCTTGCC GTCGCGTGGC 
GGCATTGGCG ATTTCGGCCC CGCTGCTTAT GAATTCGTAA ATTGGCTGGC GGAGGGCAAG
CAAACGCTGT GGCAGATCCT GCCGCTCGGG CCTCCGGGGA TCGGGAACTC GCCGTATTCA
TCGACGTCGG CATTTGCGGG AAACATTGTG CTCATCAGCC TCGAGCGGCT GGCGGAGCGC
GGAATGCTCG ACAACGCTGC GCTGAAGAGC TTGCCGGAAA GCGATGGGAG CCGCGTTAAT
TTTGAGAACG TATTGAAGGT CAAACTCCCG CTGTTGCGAC AGGCGGCGGA GAGCTTTTTG
AAGAACGCTT ACGGAAATTC GCGCGAGCGG TTCAATCGCT TTTGCCGCGA CAACACCTGG
TGGCTCGACG ATTTCGTGCT CTTCGACGCT CTGCGCGAAC GTTACCAGGG AGCGAGCTGG
AATACGTGGC CGACCGAGAT TTCGCATTGC CAGCCGGAGG CAATTGCCAA GACGCGCAGC
GAACTGGCGC ACGAGTTGGA GGTCGCAAAG TTCCTGCAGT TTGCGTTCTT CGAGCAGTGG
GGCGCGCTGC GAAACTATTG TCATCAGCGG CGAATCCGGA TTGTGGGCGA CGTGGCGATA
TTCGTCAGCT ACGACAGCGC TGACGTCTGG ACGCACCGCG ACATCTTCCG TCTGCGCGAC
GTGGAACCCG AAGTCGTTGC CGGAGTGCCA CCGGATGCAT TCAGCGATAC CGGCCAGCGA
TGGGGGAATC CGCTATATGA CTGGAACCGC CTGCGCGAGC GCGGCTACGA CTGGTGGGTG
AGCCGCATGC GCTGGGCGCA CACCTGGTGC GACATCCTGC GCATCGATCA CTTCCGCGGC
TTTGAATCGT ATTGGGAGAT TCCGGCGGAC GAACCGACGG CGATCCACGG TAGCTGGGCG
AAAGGTCCCG CTGACGAGTT TTTCCACGTG ATCAACCGCG AACTCGGCGA GTTGCCATTC
ATTGCCGAAG ACCTGGGAAT GATCACTCCG GAAGTGCATC AATTGCGCGA GCGGCTGAAG
ATTCCCGGTA TGCGCGTGCT GCAGTTTGCG TTTGGCGATC GCGGCGCGCA CATGTACTTG
CCGCACCGCT ACGACACAAA CACCGTGGTC TATACCGGCA CCCACGACAA CGACACCACG
ATGGGTTGGT GGCAAGGTGA CGCCCAGCCG CATGAAAAGC GCGACGCCGC CGCAGCCTTT
GGCGCGAACG ACCAGAACGT GCATTGGGCG TTCATTCGCG CGGCGCAGAC ATCGCTGGCA
ACGCTGAGCG TGGTGCCGCT GCAGGACGTC TTCGGCCTTG ACAGTTCCGC GCGCATGAAT
ACGCCGAGCC TTTCCGACGG CAACTGGGGC TGGCGCTACA AGCGCGGGCT GCTTACGCAA
GATGCAGCGA AAACGCTGTC AGAACTAGCG GAGACGACGG ATCGCGACGA GTTGTTGTTA
TCAGGCGGAA ACCAGCAGGG CAACGGGGAA CGACCGGAAG ATTTCGCTGC ATAA
 
Protein sequence
MFTKRASGIL LHPSSLPSRG GIGDFGPAAY EFVNWLAEGK QTLWQILPLG PPGIGNSPYS 
STSAFAGNIV LISLERLAER GMLDNAALKS LPESDGSRVN FENVLKVKLP LLRQAAESFL
KNAYGNSRER FNRFCRDNTW WLDDFVLFDA LRERYQGASW NTWPTEISHC QPEAIAKTRS
ELAHELEVAK FLQFAFFEQW GALRNYCHQR RIRIVGDVAI FVSYDSADVW THRDIFRLRD
VEPEVVAGVP PDAFSDTGQR WGNPLYDWNR LRERGYDWWV SRMRWAHTWC DILRIDHFRG
FESYWEIPAD EPTAIHGSWA KGPADEFFHV INRELGELPF IAEDLGMITP EVHQLRERLK
IPGMRVLQFA FGDRGAHMYL PHRYDTNTVV YTGTHDNDTT MGWWQGDAQP HEKRDAAAAF
GANDQNVHWA FIRAAQTSLA TLSVVPLQDV FGLDSSARMN TPSLSDGNWG WRYKRGLLTQ
DAAKTLSELA ETTDRDELLL SGGNQQGNGE RPEDFAA