Gene Sde_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2067 
Symbol 
ID3967451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2645269 
End bp2647320 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content45% 
IMG OID637921157 
Producthypothetical protein 
Protein accessionYP_527539 
Protein GI90021712 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.312074 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.672104 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTAGCC AAATAGCCAT TGTTAAAAAC CAAATATCTC GCGACGCATT AGCGTGGTGC 
TTGGCAGCAC AAAGTATTTG TATTCTACCT TTGTTTTTCT ATTTTCCATT TTGGGTGCCG
ATTTTGTGGT TGGTTGCCGC AGCGTGGCGA GTGCAAATTT ACCGCGGCCA CTGGGGTTAC
CCACCTTCTT TTGTGAAACT TCTTTTAGGG CTTAGTTGTA TTGCTGGGCT TTGGGTTACT
TATCGAGGTG GGTTTGGGGT AGAGCCGGTT GTTGCATTTT TAGTGTGCGC ATTTGTTTTA
AAACTATTAG AAGCGCGCAC TAAAAAAGAT GTGGTTATCG TTTTATACGT TTGTTTTGTG
GCTGTAGCTG CCCAGTTACT TTTTAATCAA ACTATTATTG CCGCTTTCTA TGCCATTATT
TCGTTAGTTG TAATAATTGC AGCATTTACT GCAGTGTTTA GCCTTGTGGC AAACCGTATT
AAACACAGGC TTTTTTATGC CAGCAAATTA ACCTTGCAGG CGCTACCTTT AGCCGTTGTG
TTATTTCTTG TGTTGCCAAG GTTGGGGCAG TTATGGGCGG TGCCATTAAA TAATGCTAGT
GGCACAACTG GGTTCAGTGA CTCGATGGCC CCTGGTGATA TTAGCAATTT GATTCGCTCA
AGCGCGGTCG CTTTTCGTGC TTCGTTTACT AGTAGTAACG AAGATGTTGC CGAACCTGTT
ATACCCAAAC AGCAAGATTT ATATTGGCGC GGTTTAGTGC TGGAAGATTA CGACGGTCGC
CGTTGGAAGC GAAACCGGTA TCAGCAGGCT GCAGATTTCG CAAGATCCTC AAACAAAGAG
GTAACGAGTG AGGTTGTAAC CGCAGGCGAA TATGTAGAAT ATTCTATTTT AATGGAACCC
CACCAGCAGC GTTGGTTGTT TACCCTTATG GCTCCAGAGG TGCTATTAAA TAGCACGGTA
AGTATAGGTT TTAAACCTCA GGCATTGCTT TCCGCTAATC TACCAGTAAC TCAGCGTATT
CAATACCAAA TCCGATCCGC TCAAAAATAT AGTTTTCAAA CAGAAGGCTT AAGGCGGGAT
GAGCTTAAAC GATCTCTTCA GGTGCCGGAT GAGCTAAACC CTCGAACTAA AGAATTGGTT
GAAAGTTGGT TGGCAAGTGG GTTAACTCAG CAACAAATTA TAGAAAAAGC CCTGGATTTA
TATAGAGAAA GTTTTTACTA CACGTTAAAC CCACCGCCAT TAGGCAGTGA TGCGATTGAT
GAGTTTTTGT TTATAACGCG CCGTGGTTTT TGTGAGCATT TTAGTAGTAG CTTTACCTAT
ATGATGCGCT TGGCCAATAT TCCCGCGCGC GTAGTGGTTG GTTATCAGGG GGGCGAGCAA
AATAGCATAG AAAACTATTT ACTTGTGCGG CAATCAGATG CGCACGCGTG GGCTGAGGTT
TGGTTAGAAG GGAAAGGCTG GCAGCGAATT GACCCCACCC ATGTGGTCGC TCCAAATCGT
ATAGAGCAAG GCCTTTTTAA TTCTGTAGAA GAAAGTGAGG CCGCGCAACT TGAGGGCCGC
TTTGTTAAGC AGTTTGCGCT ATTGAATAGA CTGCAAATGC GATGGGATTC TATAAATTAC
TTATGGCATT CTCAGGTTTT GGCCTACGAC AGCAATGCTC AAAAAAGTTT GTTTAAGCGC
TTATTGGGTG GCACGGAATA TTGGCGTATA GCCCTAGCAC TATTTGGCAG TGTGGCAGTG
TTGTTATGTA TGTATTTTAT TGCTCCCATG CTGCAAAAGG GGGCTAGTCA GCCTGCGGAA
GTGAAGCTGT TCAATATCTT TGAAAAAAAG CTTTGTAAAT TGGGGTATAC ACGTTTAAAA
GGTGAGACTG CGGCGCAATT TGTTGAGCGC ATTGCAAAGG CTAACCCCGA GTTGGCCGAT
CCATTGCTCA AAGTGCGGGA TTCATTTTAT GCTGCGGCCT ATCAAAGCTC GGCAGAAAAT
ATCGCTGTTG GAGTGAAGCG GCTTGCAGCA GTGCTGAAGG TATTTCCCTA CAGTAAGGTG
AGCCCGCCAT AA
 
Protein sequence
MTSQIAIVKN QISRDALAWC LAAQSICILP LFFYFPFWVP ILWLVAAAWR VQIYRGHWGY 
PPSFVKLLLG LSCIAGLWVT YRGGFGVEPV VAFLVCAFVL KLLEARTKKD VVIVLYVCFV
AVAAQLLFNQ TIIAAFYAII SLVVIIAAFT AVFSLVANRI KHRLFYASKL TLQALPLAVV
LFLVLPRLGQ LWAVPLNNAS GTTGFSDSMA PGDISNLIRS SAVAFRASFT SSNEDVAEPV
IPKQQDLYWR GLVLEDYDGR RWKRNRYQQA ADFARSSNKE VTSEVVTAGE YVEYSILMEP
HQQRWLFTLM APEVLLNSTV SIGFKPQALL SANLPVTQRI QYQIRSAQKY SFQTEGLRRD
ELKRSLQVPD ELNPRTKELV ESWLASGLTQ QQIIEKALDL YRESFYYTLN PPPLGSDAID
EFLFITRRGF CEHFSSSFTY MMRLANIPAR VVVGYQGGEQ NSIENYLLVR QSDAHAWAEV
WLEGKGWQRI DPTHVVAPNR IEQGLFNSVE ESEAAQLEGR FVKQFALLNR LQMRWDSINY
LWHSQVLAYD SNAQKSLFKR LLGGTEYWRI ALALFGSVAV LLCMYFIAPM LQKGASQPAE
VKLFNIFEKK LCKLGYTRLK GETAAQFVER IAKANPELAD PLLKVRDSFY AAAYQSSAEN
IAVGVKRLAA VLKVFPYSKV SPP