Gene SeD_A4643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4643 
SymboltyrB 
ID6871279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4484762 
End bp4485955 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content56% 
IMG OID642787545 
Productaromatic amino acid aminotransferase 
Protein accessionYP_002218143 
Protein GI198244101 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1448] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.319396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value0.236394 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTCAAA AAGTTGACGC CTATGCCGGC GATCCGATTC TTTCACTGAT GGAGCGTTTT 
AAAGACGACT CCCGTCACGA CAAAGTAAAT CTGAGCATTG GCCTGTATTA CAACGAAGAC
GGGATTATCC CACAGCTCAA AACGGTGGCC GAAGCCGAAG CCCGGCTTAA CGCGCAGCCG
CATGGCGCTT CGCTGTACCT GCCGATGGAA GGACTCAATA CTTATCGCCA TACTATCGCG
CCTTTGCTCT TTGGCGCCGA TCACCCGGTT CTTCAGCAAC AGCGCGTGGC CACTATCCAG
ACATTAGGCG GCTCCGGCGC GCTGAAAGTA GGCGCGGATT TCCTGAAGCG TTATTTCCCC
GACGCAGGCG TATGGGTGAG CGACCCCACC TGGGAAAACC ATATTGCGAT ATTTGCCGGG
GCAGGATTCG AAGTGAGTAC TTACCCCTGG TATGACGACG CGACTAACGG CATCCGTTTT
AACGATCTGC TGGCCACGCT GAATACGTTA CCTGCGTGCA GTATCGTGCT GCTGCACCCC
TGTTGTCACA ACCCGACCGG GGCGGATTTA ACGCCTTCGC AATGGGATGC GGTGATTGAA
ATAGTGAAAG CGCGCGATCT GATTCCGTTC CTTGATATTG CCTATCAGGG GTTTGGCGCA
GGCATGGACG AGGATGCTTA CGCTATTCGC GCCATTGCCA GCGCCGGGTT ACCTGCGTTA
GTCAGTAATT CTTTTTCGAA GATTTTCTCG CTGTACGGCG AGCGCGTCGG CGGCCTGTCC
GTGGTGTGTG AAGATGCTGA AATCGCCGCG CGGGTTCTGG GACAGCTAAA AGCGACGGTG
CGCCGAATTT ACTCCAGTCC GCCGTGTTTC GGCGCTCAGG TGGTCGCTAC GGTCCTGGGC
GATGAGGCGT TAAAAGCGGG CTGGCTGGCG GAAGTCGACG CGATGCGTAA CCGCATTATA
TCGATGCGCC AGACGCTGGT GAAGGAGCTG AAGGCGGAGA TGCCTGACCG CAACTTTGAT
TACTTGTTAC AGCAGCGCGG TATGTTCAGC TATACCTGGT TAAGCGCGGA GCAGGTCGAG
CGGTTGCGCG ATGAGTTTGG CGTTTACCTG ATTGCCAGTG GCCGCATGTG TGTCGCCGGG
CTTAATGCTT CAAATGTACA CCGCGTGGCG AAGGCATTTG CCGCTGTCAT GTAA
 
Protein sequence
MFQKVDAYAG DPILSLMERF KDDSRHDKVN LSIGLYYNED GIIPQLKTVA EAEARLNAQP 
HGASLYLPME GLNTYRHTIA PLLFGADHPV LQQQRVATIQ TLGGSGALKV GADFLKRYFP
DAGVWVSDPT WENHIAIFAG AGFEVSTYPW YDDATNGIRF NDLLATLNTL PACSIVLLHP
CCHNPTGADL TPSQWDAVIE IVKARDLIPF LDIAYQGFGA GMDEDAYAIR AIASAGLPAL
VSNSFSKIFS LYGERVGGLS VVCEDAEIAA RVLGQLKATV RRIYSSPPCF GAQVVATVLG
DEALKAGWLA EVDAMRNRII SMRQTLVKEL KAEMPDRNFD YLLQQRGMFS YTWLSAEQVE
RLRDEFGVYL IASGRMCVAG LNASNVHRVA KAFAAVM