Gene Acid345_3983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3983 
Symbol 
ID4072456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4711117 
End bp4714413 
Gene Length3297 bp 
Protein Length1098 aa 
Translation table11 
GC content58% 
IMG OID637986010 
Productalanyl-tRNA synthetase 
Protein accessionYP_593057 
Protein GI94971009 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0013] Alanyl-tRNA synthetase 
TIGRFAM ID[TIGR00344] alanine--tRNA ligase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGGGC GGTGCCTGAT AAAATACTCG TTTACGCCCA TGATGACAGG CTCACAAGTA 
CGCCGCCGCT TTCTGGATTT CTTTATCTCC AAGGGCCACA AAGAGGTCCA CTCTTCGTCG
CTGGTGCCCG CCAACGACCC GACGTTGCTG TTCACCAACG CGGGCATGAA CCAATTCAAA
GATGTCTTTC TCGGAGTCGA GAAGCGCGAC TACAACCGCG CTACTACCTC GCAGAAGTGC
GTCCGCGCGG GCGGCAAACA TAACGACCTG GAGAATGTTG GGTTCACCAA CCGTCACCAT
ACCTTCTTCG AGATGCTCGG CAACTTCAGC TTTGGCGACT ACTTCAAGAA GGACGCGATT
GCCTACGCAT GGGAGCTGGT GACATCGCCT GACTGGTATG GGATGCAGCA GGACAAGCTC
TACGCGACGA TTTTCAAGGG CGAGAACGGC GTGCCGCGCG ACGACGAGGC CTACCAGCTC
TGGCTCGACG TGGGTGTCCC CAAAGAGCGC GTCTTCGAAA TGGGGATGAA AGACAATTTC
TGGGCGATGG GCGACACCGG CCCTTGCGGT CCGTGCTCGG AGTTGCACTA CGACATGGGC
GTAGCTTCGT CCGACAAGAA GAACCCGGAG TGTGCCGCGG GGGAATGCAA GTTCCCGTGC
GAATGCGGGC GGTATGTCGA GATCTGGAAT TTGGTGTTCA TGCAGTTCGA CCGCGATGCC
AGCGGCACAC TGAATCCACT GCCGAAGCCT TCGATTGATA CCGGCGCCGG CCTGGAGCGC
GTGACGGCGG TGATGCAGGG TGTGGTGTCG AACTACGACA CGGATTTGTT TACGCCGCTG
ATTAAGCGCG CGGCGGAACT GACCGACGTT GACGAGAAGA GAGAAGAAGC GAAAGAGTCG
CAGTCGAAGA GCGCAGCCTC GTTGCGAGTA ATCGCCGACC ACTCGCGGGC GGCAACGTTC
CTGATCAGTG ATGGCGTGAT TCCGGCCAAC GAAGGCCGCG GCTACGTGCT GCGCAAGATC
ATTAGGCGCG GGATTCGCCA CGGACGTCTG CTCGGACAGA ACAAACCGTT CCTGCACGAC
ATGGTCTATG CGGTACGCGA CCTGATGCAG GATGCGTATC CGGAGCTGAA AGAGACAGCG
GACCGGGTGG CGAAGACGGT ACTGGCAGAA GAGACACGGT TTGCGAACAC ACTGGATATT
GGGCTGAAGA AACTGGATGA AGACCTCGCC TCATTGGCAG AGACGACTCG GGAATTGGGC
CGGCTTTCGT GGGAAGAGCC AAAGCCCGTC GAATTTGCGC TTAAGGAAGT TGCTCGCCAT
GTCTTTGTGG TCGCGCAGAT GCTCAGTCCG GGTATGACTT TTGACACGCC CGGTCATGGT
TTCTTACTAA GAAGAGGTAT TTACCGGGCC TGGATGGATT TGAAGGCTGT CGGGCTCGAC
AGAGTCGTGC GACTTTCCGA GATTGCAGAG GCGCTCTCGG AGGGCGAAAC CGGACGTGTT
CAAAGTGTTG CTAAGAACCT AGCACACACT AAGCAGATCC TAGACGCAGA GCAGGAACGA
CTGCAGAGTG CCTCATCTCA ACTCGAGTTC GAATTGGACG CTATGGAAGC TGAGAAAAAG
CTCGCAGAAA ATCCTGAAAT GCTCGCTGAG CAAGTGCGAG AGGTATTCGG ACCCGCTCTC
GAAGAGAAGG TAATCAATGA GCTGCGCGGC AAACTGCAAA GCCGTCCCGT CTACTCAGGC
GACAAAGCTT TCAAGCTTTA CGACACGTTC GGCTTGCCGC TCGATTTCAT GGTAGATGCC
GCACGCGACC AAGGGATCGA ATTCGATCAG GCGGGCTTCG ACGCGGCGAT GGAATCGCAG
CGCGAAACGG CACGCGCCTC TTGGAAGGGG GGATCGAAGT TAAGCGCCAG CCCTATTTAT
AGAGAACTTG CCGAGGAGCT TGGAACGAAG CCATATCCCG GTCAACTTGC TGGCGGACAT
CCAATGATCA CTCCACAGTC GATCTTTGTC GGATACACCA GTACTGAATT CAAGGGAGCC
ACGGTGCTGG CAATCATTTC GAATGGGGCA AGTGTCGCTC AGCTAGCTCC GGGAGATGTC
GCGGAGATGG TTCTCGATTA CACGCCGTTC TATTCGGAAT CCGGCGGACA GATTGGCGAC
ACTGGCTGGC TCTACGATTC CACGGGCAAT ACGGTTGTCG CGGAAGTCGA GACGGTACAG
TCTCCGGTGC AGGGTGTTCG TGCGCACAAG GTGACGGCGC GGCAGAACAT CGCCGTCGGC
GACAAGCTGA ATGCCGTCGT CAATGCCGAC GTCCGGCGCG CGACCATGCG CAATCACACC
GGTACGCATC TGCTGCATGC CGCGTTGCGC GAGGTATTGG GCAAGCACGT GAAGCAGGCG
GGATCGCTGG TGGATCCCGC GAAGCTGCGC TTTGATTTCT CGCACTTCAC CGGAGTGGCG
GATGAAGAAC TGCAGGACAT TGAAGACATC GTCAACAAGG AAGTGCTCAA GAACGATCGC
GTGGAAGTGA TCGAGAACGT GCCGATTGAC GTCGCGGTCA ACGAGTACAA AGCGATGGCA
CTCTTCGGCG AGAAGTACGG GGACCGGGTG CGGGTAATCA AGATCGGCGA TTTTTCGACC
GAGCTTTGCG GCGGCACCCA TACGCTGGCA ACCGGTGAGA TCGGCCTGAT CAAAGTGCTT
CACGAAGGCA GCGTGTCGAG TGGCGTGCGG CGTCTTGAGG CCGTTACCGG CGAAAATTCG
GTGCGGCACT TCCGCAAAGA TCACGAACTC GAAGGCGTGG TTTCGACGAT CGTACGTCCG
TCGGAAGGCC TCAGCCCGGC GCAGGCGTTG AAGTTCGAGC TTGACCGGCG CGAAGAAGAG
ATTAAGAAGC TGCGCAAAGA ACTCGAGCAG TCGAGGATGA AGTCGGCTTC GAGTGCGGTG
TCGTCGGCAA CGGAGAGCGC TCGCGAGGTA AAAGGCATCA AGGTGTTGGC CACGCGGGCT
GACAACATTG ACCGCAACCA GATGCGCACG CTAATCGATA ACCTGCGAAG CAAGCTTGGT
TCCGGTGTGA TTGTGTTGGG GTCAGTGCAG GATGGCAAGG TTGCGCTGAT CGTGGGTGTG
ACCAAGGACC TGACGTCGAA GATCCAGGCC GGCAAGATCA TCGCGCAGGT AGCAAAGCAC
GTTGGTGGCT CTGGGGGCGG GCGTCCGGAC ATGGCGGAAG CGGGCGGAAA AGACCCGGCG
GCGCTGGACG GTGCCTTAAA TGCCACCTAT GGCATTGTGG ACAGCTTGCT TTCGTAG
 
Protein sequence
MTGRCLIKYS FTPMMTGSQV RRRFLDFFIS KGHKEVHSSS LVPANDPTLL FTNAGMNQFK 
DVFLGVEKRD YNRATTSQKC VRAGGKHNDL ENVGFTNRHH TFFEMLGNFS FGDYFKKDAI
AYAWELVTSP DWYGMQQDKL YATIFKGENG VPRDDEAYQL WLDVGVPKER VFEMGMKDNF
WAMGDTGPCG PCSELHYDMG VASSDKKNPE CAAGECKFPC ECGRYVEIWN LVFMQFDRDA
SGTLNPLPKP SIDTGAGLER VTAVMQGVVS NYDTDLFTPL IKRAAELTDV DEKREEAKES
QSKSAASLRV IADHSRAATF LISDGVIPAN EGRGYVLRKI IRRGIRHGRL LGQNKPFLHD
MVYAVRDLMQ DAYPELKETA DRVAKTVLAE ETRFANTLDI GLKKLDEDLA SLAETTRELG
RLSWEEPKPV EFALKEVARH VFVVAQMLSP GMTFDTPGHG FLLRRGIYRA WMDLKAVGLD
RVVRLSEIAE ALSEGETGRV QSVAKNLAHT KQILDAEQER LQSASSQLEF ELDAMEAEKK
LAENPEMLAE QVREVFGPAL EEKVINELRG KLQSRPVYSG DKAFKLYDTF GLPLDFMVDA
ARDQGIEFDQ AGFDAAMESQ RETARASWKG GSKLSASPIY RELAEELGTK PYPGQLAGGH
PMITPQSIFV GYTSTEFKGA TVLAIISNGA SVAQLAPGDV AEMVLDYTPF YSESGGQIGD
TGWLYDSTGN TVVAEVETVQ SPVQGVRAHK VTARQNIAVG DKLNAVVNAD VRRATMRNHT
GTHLLHAALR EVLGKHVKQA GSLVDPAKLR FDFSHFTGVA DEELQDIEDI VNKEVLKNDR
VEVIENVPID VAVNEYKAMA LFGEKYGDRV RVIKIGDFST ELCGGTHTLA TGEIGLIKVL
HEGSVSSGVR RLEAVTGENS VRHFRKDHEL EGVVSTIVRP SEGLSPAQAL KFELDRREEE
IKKLRKELEQ SRMKSASSAV SSATESAREV KGIKVLATRA DNIDRNQMRT LIDNLRSKLG
SGVIVLGSVQ DGKVALIVGV TKDLTSKIQA GKIIAQVAKH VGGSGGGRPD MAEAGGKDPA
ALDGALNATY GIVDSLLS