Gene Sde_2231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2231 
Symbol 
ID3964835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2835184 
End bp2836482 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content45% 
IMG OID637921322 
Productputative aminopeptidase 2 
Protein accessionYP_527703 
Protein GI90021876 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1362] Aspartyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.512559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000110487 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGGCA ATACATTTAA TAATCAAGAT TTAATTCAGT TTTTACAGGA CTCGCCCACA 
CCTTTTCATG CTGTTTTGAG TATGTCTCAG CGTATGCAGG CAGCGGGGTT TGTTGAGCTG
AATGAAAACG AAGACTGGTC GTTACAGCCT GGTGGAAAAT ATTTTGTGGT GCGCTCGGGC
ACTGCCATTG CTGCATTTAT TCATGGCGTA GAATCTAGTG TAGATCACGG CATACGAGTT
GTTGGGGCAC ATACAGATAG CCCTTGTTTG AAAGTAAAGC CAATGCCTTC TAAATTGAGC
AATGGCTATC AGCAACTTAG TATTGAAGTG TATGGGGGCG TACTGTTAGC GCCGTGGTTC
GATCGAGATT TGTCACTTGC TGGCCGTGTT GTTTATCGCG ATACAAGTGG TGCGCTTAAG
TCTGCGCTTA TCAATTTTAA GCGTGCAATT GGCAGCGTGC CGAGCTTAGC CATTCACTTG
GACAGAGCTG CTAACGAAGG CAGAAAAATT AACCCCCATG TAGAAATGGA TGTTGTACTC
GGTCAAAGTG CGCAAAAGCT GGATTTTAAA CTGTTATTAC AAGAGCAGAT GCAGCTTGAA
GGGTATGACA ATATTGCTGA GATTTTAGAT TTTAACCTGA GCTTTTATGA TGTTCAGCCA
CCGGCGGTAT TAGGGTTAAA CAACGAGTTT CTAGCCAGTG CTAGATTGGA CAACCTGTTG
AGTTGCTATA TAGGAATAAA CAGCTTAATC CAAGCTGATA CCACCTATAC CAGTGTTGTT
ATTTGTAATG ACCACGAAGA GGTTGGCAGT CGCTCTGAGG TAGGGGCGCA AGGGCCCATG
TTAAAAGATG TGTTAAGTCG CATTAATCCA GACCCCCAAG CCAATCAAAA AGCAATTCGC
CGTTCGTTAA TGTTATCGGT CGACAATGCC CACGGTATTC ACCCGAACTA CGCGAGTAAG
CATGACGAGA ACCACGGCCC AATCATTAAC GCCGGCCCTG TGATTAAGTT CGACGCCTGT
CAAGGCTATG CCACCAATAG TGATAGCGCC GCTTTTGTAC GCTGGTTGGC CACCAAAGGG
GAGCCTATTG CTTTGCAATC CTTTGTAATG CGAGCAGATA TGCGTTGTGG CAGCACTATT
GGCCCAATCA CTGCGACAGA GTTGGGTATT CAAACAGTGG ATATTGGCCT TGCAACTTTT
GGTATGCATT CGGTAAGAGA GCTTGGTGGC GTTAAAGACG GTGAGCAGCT TAATAATTTG
CTTATGCGGT TTATGAGCAC AGAGACCTTA AAGTTATAA
 
Protein sequence
MTGNTFNNQD LIQFLQDSPT PFHAVLSMSQ RMQAAGFVEL NENEDWSLQP GGKYFVVRSG 
TAIAAFIHGV ESSVDHGIRV VGAHTDSPCL KVKPMPSKLS NGYQQLSIEV YGGVLLAPWF
DRDLSLAGRV VYRDTSGALK SALINFKRAI GSVPSLAIHL DRAANEGRKI NPHVEMDVVL
GQSAQKLDFK LLLQEQMQLE GYDNIAEILD FNLSFYDVQP PAVLGLNNEF LASARLDNLL
SCYIGINSLI QADTTYTSVV ICNDHEEVGS RSEVGAQGPM LKDVLSRINP DPQANQKAIR
RSLMLSVDNA HGIHPNYASK HDENHGPIIN AGPVIKFDAC QGYATNSDSA AFVRWLATKG
EPIALQSFVM RADMRCGSTI GPITATELGI QTVDIGLATF GMHSVRELGG VKDGEQLNNL
LMRFMSTETL KL