Gene Acid345_3931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3931 
SymboltnaA 
ID4071314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4644695 
End bp4646071 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content59% 
IMG OID637985957 
Producttryptophanase 
Protein accessionYP_593005 
Protein GI94970957 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0694288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.220005 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATCC GCACCATCAT TGAACCTTTC CGTATCAAGA GCGTCGAACC CATCCGCTGG 
ACCACGCGTA AAGAGCGCGA AGAGCTGTTA AAAAAGGCGT CGTATAACGT CTTTCTCCTC
GACGCCGAAG ATGTCCTGAT CGATCTCCTT ACCGACTCGG GCACGGGCGC CATGTCCACC
GCGCAGTGGG CCGCCGTGAT GCAGGGCGAC GAGAGCTACG CCGGAAGCCC CAGTTTCTAC
CGCTTCCGCG ATTCAGTGCA GGAGATTATG GGCTACAAGC ACGTCATCCC CACGCACCAG
GGTCGCGCCG CAGAGCGCAT CTTGTTCAGC GTAATGTGCA AGAAGGGCGA TGTCGTCCCT
AACAACACCC ACTTCGATAC CACGCGCGCC AACGTCGAAT TCACCGGCGC CGAAGCCGTA
GACCTCCTTC GCGAAGAAGG CCGCCATCCC GAAGTCATTC ACCCCTTCAA GGGCAACATG
GATGTCGAGG CGCTCGAAGC ACTCATCCAG AGAGTCGGCC GCGCGCGCAT TCCGCTGGTG
ATGCTGACCG TCACCAACAA CTCCGGCGGC GGCCAGCCCG TCTCCATGGA AAACGCTCGC
CAGGTCAGCG CCGTCTGCAA AAAATATGGA ATTCCACTCT ACTTTGACGC GTGCCGCTTC
GCCGAGAACT CCTACTTCAT TAAGCTGCGC GAGCCGGGCT ACGCCGAAAA AACGCCGAAA
GAAATTGCCC AGGAAATGTT TGCGCTCGGC GACGGCTGCA CCATGTCAGC CAAAAAAGAC
GGCATGGCGA ATATCGGCGG CTTCCTTTGT ACCAACGACG ACATCATCGC CCAGCAGGAG
AAGAACCTGC TCATTCTCAC CGAAGGGTAT CCAACTTATG GTGGTTTAGC GGGCCGCGAC
CTTGAGGCGA TTGCTGTAGG AATTCAGGAA GCACTGCACG AAGATTATTT GCGCTACCGC
ATCACTTCGA CCGGCTACCT CGGGCGCAAA CTCACGGAAG CTGGCGTTCC GATCGTGCAG
CCGCCGGGCG GCCATGCGAT CTATCTCGAC GCGCGCACGT TTTTGCCGCA CATCCCGCTC
AACCAATTTC CTGGCGTCGC ACTCACCTGT GAGCTCTATC TTGAAGGAGG CATCCGCGCC
GTCGAGATCG GTGGCCTGAT GTTCGGCAAG GCAGCCAAGA TGGACCTCGT GCGTCTCGCT
ATTCCGCGGC GCGTGTACAC GCAGAGCCAC ATTGATTATG TGATCGAAGT TCTGCTGGAA
GTGTGGAAAC GCCGCGAAGA TGTGGATGGC ATGGAGCTTC TCTACGAAGC GCCGTTCTTG
CGGCATTTTA CGGCACGACT CTGCCCCACC GCGGAAGTCG CTGCCGCAGC GCGGTAA
 
Protein sequence
MPIRTIIEPF RIKSVEPIRW TTRKEREELL KKASYNVFLL DAEDVLIDLL TDSGTGAMST 
AQWAAVMQGD ESYAGSPSFY RFRDSVQEIM GYKHVIPTHQ GRAAERILFS VMCKKGDVVP
NNTHFDTTRA NVEFTGAEAV DLLREEGRHP EVIHPFKGNM DVEALEALIQ RVGRARIPLV
MLTVTNNSGG GQPVSMENAR QVSAVCKKYG IPLYFDACRF AENSYFIKLR EPGYAEKTPK
EIAQEMFALG DGCTMSAKKD GMANIGGFLC TNDDIIAQQE KNLLILTEGY PTYGGLAGRD
LEAIAVGIQE ALHEDYLRYR ITSTGYLGRK LTEAGVPIVQ PPGGHAIYLD ARTFLPHIPL
NQFPGVALTC ELYLEGGIRA VEIGGLMFGK AAKMDLVRLA IPRRVYTQSH IDYVIEVLLE
VWKRREDVDG MELLYEAPFL RHFTARLCPT AEVAAAAR