Gene Sare_4140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4140 
Symbol 
ID5705585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4705945 
End bp4707051 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content72% 
IMG OID641273568 
ProductN-succinyldiaminopimelate aminotransferase 
Protein accessionYP_001538921 
Protein GI159039668 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID[TIGR03539] succinyldiaminopimelate transaminase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.867426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0592694 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTGAACCGGC CCACGCCAGT CTCGACTCGG CTGCCCGAGT TCACCTGGGA CGCCCTGGAC 
GCCGCGGCGG CCGTGGCCGC GGCGCATCCA GGGGGTCTGA TCAACCTCTC CATCGGTACC
CCGGTCGATC CGGTACCGCC GGTGATCCGG GAGTCGCTGG CTGACGCGTC GGATGCTCCC
GGGTACCCCC GGACCGCCGG CTCGCCGGCG CTGCGGGCCG CGATCGCGGC TTGGGCGGCG
CGAACCTGCG GGGCTGAACC GGATCGGCTC GGCGTGCTGC CAGCGGTCGG CTCGAAGGAG
TTGGTGGCCT GGTTGCCGAC GCTGCTGGGG ATCGGGCCGC AGGACGTGGT CGTGGTGCCG
TCAGTCGCGT ATCCGACCTA CGAGGATGGG GCCCGACTCG CCGGGGCGAC GGTCGTGCGG
GCCGATTCGC TGACCGCCGT TGGCCCGAAC CCCCGGGTCC GCCTGGTCTG GGTGAATTCG
CCGGGAAACC CGACGGGCCG GGTGCTGCCC GCCGCGCACC TGCGCAAGGT CGTCGACTGG
GCCCGCGAGC GCGGCGCAGT TGTCGCCAGC GACGAGTGCT ACCTGCCGTT GGGCTGGGAG
ACCGAGCCGG TCTCGGTGCT GTCGCCGCAG GTGTGCGGTG ATTCGTACGA TCGTGTGCTG
GCAGTGCACT CGCTGTCGAA ACGCTCCAAC CTCGCCGGGT ATCGGGCCGG CTTCGTGGCC
GGCGACCCGG CGCTCGTCGC CGAGTTGCTC AAGGTGCGCA AGCACGCCGG GATGATCGTC
CCGGCGCCGG TGCAGGCCGC GATGGTAACC GCCCTTCGGG ACGAGCAACA CGCCACGGAG
CAGCGGGAGC GCTACCGGAA CCGGCGTGCT GTGCTGCGTG CCGCGTTCAC CGCCGCCGGG
TTCACCGTCG AGCATTCGGA GGCCGGACTC TACCTGTGGC TGACCCGGGA CGAGGACTGC
TGGGCGACGG TCGACTGGCT GGCCCGGCGG GGGATCCTGG TGGCGGCGGG TGCCCTGTAC
GGGCCGGCCG GCGCTCGGCA TGTTCGTGTG GCGCTGACCG AGACCGACCA ACATGTGGCG
GCGGTCGCCG ACCGGTTGGC CGACTGA
 
Protein sequence
MNRPTPVSTR LPEFTWDALD AAAAVAAAHP GGLINLSIGT PVDPVPPVIR ESLADASDAP 
GYPRTAGSPA LRAAIAAWAA RTCGAEPDRL GVLPAVGSKE LVAWLPTLLG IGPQDVVVVP
SVAYPTYEDG ARLAGATVVR ADSLTAVGPN PRVRLVWVNS PGNPTGRVLP AAHLRKVVDW
ARERGAVVAS DECYLPLGWE TEPVSVLSPQ VCGDSYDRVL AVHSLSKRSN LAGYRAGFVA
GDPALVAELL KVRKHAGMIV PAPVQAAMVT ALRDEQHATE QRERYRNRRA VLRAAFTAAG
FTVEHSEAGL YLWLTRDEDC WATVDWLARR GILVAAGALY GPAGARHVRV ALTETDQHVA
AVADRLAD