Gene Sde_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2039 
Symbol 
ID3967398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2566486 
End bp2567700 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content48% 
IMG OID637921127 
Productamidophosphoribosyl transferase 
Protein accessionYP_527511 
Protein GI90021684 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTACG AATTAGCGTA TGTTTTTGCC GCTAGCCCCA CTGGCGAGCA GCACTTTGCG 
GGCACGCTCG AATCCAAGAC TGGCACAGGC ACGTTCACTT ATGCCGCTAG CTGGTTGGCA
AACGATTGGG CCTACCCACT AGACCCATTA AATCTACCCC TTTCAACTAA ACGTTACCGA
GCACTTAACA AGCACGGCTT ATTTGGTGTG TTTTGTGATG CTGCGCCAGA TGATTGGGGA
ACGCGTATTA TGCTGCTGCG CCACGAGCAC GCGCCTGCAA ACGAACTCGA GCGGTTAATT
CGCACAAGCG GAGGTGGGGT TGGCTGCCTG CGTTATAGCT TGTCGCGCGG GCAAGCTAAA
GTGCCGGCGC CATTGCCCAC AATGCAGCAC TTACACGATT TAGCTAAAGC TGCAGAGAAG
CTAGAACTAA AACACAAACT TGCCCCCGAA GAATTAGCCT TGCTCGAGCC TGGGTCATCT
ATGGGCGGCG CACGGCCAAA GGTTACCGTC GCCGATGATG AAACTCGTTG GTTAGTTAAG
TTCGCAAAAG CATGGGACCT AGTGGATGTT CCCTTGCTGG AATACTGCAG TATGCAGTTT
TTAAAAGATG TACTTGGCTT GAACGTACCA GAAACTCGAT TGATTAATGT GGGGGGTAAA
AACGCGTTTG CGATTGTTCG ATTTGATGGT GTGGCTAATT CACCAAAGCA TTTTATTTCT
GCCAACAGTT TGTTTAACCA AGATCGTATT CGCCCCATCG AAGATTCAAA GCGTAACCCA
TATTCCTATT GCAACTTGGC CAGCATTATT CGCAAGCACT GCTCGAATTT TAAAGAGGAT
AATAAAGAGC TGTTTATGCG TATGGTTGCC AACATTGTAA TGGGGAATAC CGATGACCAT
GCGCGTAATC ATGCTTTGTT ATTTGATATT ACTACCGCTA AGTGGCAGCT TTCACCGGCT
TACGACATGC TGCCCATAGT GGCTACACGC AGCGGTTTAC AAGCTATGGG AGTTGGTGAG
CATGGAGCGA AAGCTACGTT AGAAAACGCT TTAAGTTACG CAAATTTATT TGGTTTAAAA
ACTGCAGAAG CACAAGCTCT GTGTCAAAAA GTTAGTAACG CCTACACTGG CTGGCCAGAC
TATTGTTTGC AAAACGGTAT GCAAGCAGCG GATGTGCAAC TCGTGCAAGG CGCTATGCTT
AATGAGTTGA ATTGA
 
Protein sequence
MAYELAYVFA ASPTGEQHFA GTLESKTGTG TFTYAASWLA NDWAYPLDPL NLPLSTKRYR 
ALNKHGLFGV FCDAAPDDWG TRIMLLRHEH APANELERLI RTSGGGVGCL RYSLSRGQAK
VPAPLPTMQH LHDLAKAAEK LELKHKLAPE ELALLEPGSS MGGARPKVTV ADDETRWLVK
FAKAWDLVDV PLLEYCSMQF LKDVLGLNVP ETRLINVGGK NAFAIVRFDG VANSPKHFIS
ANSLFNQDRI RPIEDSKRNP YSYCNLASII RKHCSNFKED NKELFMRMVA NIVMGNTDDH
ARNHALLFDI TTAKWQLSPA YDMLPIVATR SGLQAMGVGE HGAKATLENA LSYANLFGLK
TAEAQALCQK VSNAYTGWPD YCLQNGMQAA DVQLVQGAML NELN