Gene SNSL254_A1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1031 
SymbolaspC 
ID6486172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1048891 
End bp1050081 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content55% 
IMG OID642736437 
Productaromatic amino acid aminotransferase 
Protein accessionYP_002040196 
Protein GI194445730 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1448] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.643394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.0066077 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTGAGA ACATAACCGC CGCTCCTGCC GACCCGATTT TGGGCCTGGC CGATCTGTTT 
CGCGCCGATG ACCGTCCAGG GAAAATTAAC CTTGGCATTG GCGTATATAA AGATGAAACC
GGCAAAACAC CGGTGCTCAC CAGCGTTAAA AAAGCCGAGC AGTATTTACT GGAAAATGAA
ACGACGAAAA ACTATCTCGG CATTGACGGG ATTCCGGAAT TTGCTCGCTG CACTCAGGAA
CTGCTGTTCG GCAAAGGCAG CGCGCTGATT AACGATAAGC GCGCACGCAC GGCGCAAACG
CCGGGCGGTA CCGGCGCGTT ACGCATTGCC GCCGACTTTT TAGCAAAAAA TACTCCGGTA
AAACGGGTGT GGGTGAGCAA CCCCAGCTGG CCGAACCACA AAAGCGTGTT TAACGCTGCC
GGCCTGGAAG TTCGGGAGTA CGCTTACTAC GATGCGGAAA ATCACACGCT GGATTTTGAA
GCGCTACAAG CCAGCCTGAG CGAAGCACAG GCCGGCGATG TGGTTCTGTT CCACGGTTGC
TGTCATAACC CAACCGGCAT TGACCCTACT CTGGAACAGT GGCAGGTTCT GGCTGAGCTT
TCCGTTGAAA AAGGGTGGCT GCCGCTATTC GATTTCGCTT ACCAGGGCTT TGCCCGCGGC
CTGGAAGAAG ATGCGGAAGG TCTGCGCGCC TTTGCCGCGC TGCATAAAGA GCTCATTGTC
GCCAGCTCCT ACTCCAAAAA CTTCGGCTTA TATAATGAGC GCGTCGGCGC CTGCACCCTC
GTGGCTGCCG ATGCGGAAAC CGTGGATCGC GCTTTTAGCC AGATGAAATC CGCCATTCGC
GCTAACTACT CCAACCCACC AGCGCACGGC GCGTCCATCG TCGCGACTAT CCTGAGTAAT
GACGCCCTGC GCGCTATCTG GGAACAGGAA CTGACGGACA TGCGCCAGCG TATTCAGCGT
ATGCGTCAAC TGTTCGTGAA TACCTTGCAG GAAAAAGGCG CGAACCGTGA CTTCAGCTTT
ATTATCAAGC AGAACGGGAT GTTCTCATTC AGCGGTCTGA CCAAAGATCA GGTGCTGCGT
CTGCGTGAAG AGTTTGGCGT TTATGCCGTG GCTTCTGGTC GCGTGAACGT GGCGGGCATG
ACCCCGGATA ATATGGCGCC GCTGTGCGAA GCCATTGTCG CGGTACTGTA A
 
Protein sequence
MFENITAAPA DPILGLADLF RADDRPGKIN LGIGVYKDET GKTPVLTSVK KAEQYLLENE 
TTKNYLGIDG IPEFARCTQE LLFGKGSALI NDKRARTAQT PGGTGALRIA ADFLAKNTPV
KRVWVSNPSW PNHKSVFNAA GLEVREYAYY DAENHTLDFE ALQASLSEAQ AGDVVLFHGC
CHNPTGIDPT LEQWQVLAEL SVEKGWLPLF DFAYQGFARG LEEDAEGLRA FAALHKELIV
ASSYSKNFGL YNERVGACTL VAADAETVDR AFSQMKSAIR ANYSNPPAHG ASIVATILSN
DALRAIWEQE LTDMRQRIQR MRQLFVNTLQ EKGANRDFSF IIKQNGMFSF SGLTKDQVLR
LREEFGVYAV ASGRVNVAGM TPDNMAPLCE AIVAVL