Gene Sde_1012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1012 
Symbol 
ID3967766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1293591 
End bp1294661 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content49% 
IMG OID637920079 
Producthistidinol phosphate aminotransferase 
Protein accessionYP_526486 
Protein GI90020659 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000723593 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTTA TTAAAGCGCA TATTCGAGCT ATGTCGGCTT ACAAGCCGCC GTTGGATGGT 
CGTAATCCCG ACACCAATGT GTTGTTAGAT TTTAATGAGC GCACTTTGCC TGTAAGTGAA
AAAGTGCAGC GTGCGCTTAT TGATTACATA CAGAGTGGCC GTCTGCAAAT GTACCCTGCC
TACGGTAATA TTGTTGAGTT GATAGCGAAT TACGCTGGCG TAGGTGTCGA TCAGTTGATG
ATTACCAATG GTTCCGATCA GGGAATAGAG TTGGTGTTTC GTTCGGTGTG CGGTGCGGGC
GATAAGGTTG TTATTCCTGG GCCTAGTTTC GCCATGTACA GTCAGTGCGC AAAAATAGAA
AATTGCTCAA TTGTTTCGCC TCAATATACG CGCGAGCACG GTTTTCCGCT GCAGGAAGTG
TTGGATGCAA TAGAGCCTGA TGTAAAGGTG GTGGTGGTGT CTAACCCCAA CAACCCTTGT
GGTACCTTGT TGGCGCGCGA CGGTGTAGAG GCCATTCTGA AGGCCGCATC TACTCATGGT
AATAACGCTG CTGTGTTGGT GGATGAGTGT TACTTTGAAT ATACGCAGGC GACCGTTGCC
GATTTGGTGG CGGCCTATCC AAATTTAATT ATTACCCGTA CTTTTTCTAA AACGTGGGGC
ATTCCTTCGC TGCGCTTTGG CTACATTATT TCCTGTGCCG AGAATATTAA TGCGCTTTTG
AATGTGCGTG GACCTTACGA TATTAACCAG CTTGCTGTGG TGGCTGTGCG TGCCGCGTTG
GAAAATTTTT CGGATGTTTC CAGCTATATC GACGAGGTTA TGCTGCGCTC TAAGCCGGTA
TTGGAGGCGT TTTTGGATGA GCAGGGGATA GAGTATTGGC CTAGCTCCAC TAACTATATT
TGGACTTTTC CTGTTGGGCC TGAGGCGGTT GAGCTGGCGC TGCGGGCTGC GGGCATTCTG
GTGCGGCCTA AGGCGGATGC TGATGGGCGT TTGGGTTTAC GCATTACCTT GGGTACGCTC
GAGCAAACCA AGAAAGCTAT TGAGGTATTG CGGGTGGCTA CAGCTTCGTA G
 
Protein sequence
MSVIKAHIRA MSAYKPPLDG RNPDTNVLLD FNERTLPVSE KVQRALIDYI QSGRLQMYPA 
YGNIVELIAN YAGVGVDQLM ITNGSDQGIE LVFRSVCGAG DKVVIPGPSF AMYSQCAKIE
NCSIVSPQYT REHGFPLQEV LDAIEPDVKV VVVSNPNNPC GTLLARDGVE AILKAASTHG
NNAAVLVDEC YFEYTQATVA DLVAAYPNLI ITRTFSKTWG IPSLRFGYII SCAENINALL
NVRGPYDINQ LAVVAVRAAL ENFSDVSSYI DEVMLRSKPV LEAFLDEQGI EYWPSSTNYI
WTFPVGPEAV ELALRAAGIL VRPKADADGR LGLRITLGTL EQTKKAIEVL RVATAS