Gene Sde_1806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1806 
Symbol 
ID3966751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2299179 
End bp2300279 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content48% 
IMG OID637920889 
Producthypothetical protein 
Protein accessionYP_527278 
Protein GI90021451 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATGG TAAAAAACGC GCTTGCAAAG CCAGTTTGTA TTCAGCGCTC GTGCGTCTTA 
AAACGGTTTA TCCAATTTGT TAGTTGGTTG CTGTTAGGTA TTGGCGTTAT TGCGTGTGAC
AGTGACAAGC CTAGCGACAA TAATGCCATG CTCGGTGCGC AAGCTCATAT CCAGCAACTA
CAGTTAAGCA ATCAGATATC AACGTTTACC ATTACACCGC AAATAGGTGG GCGCGGCCTG
CATTTTGGTT TGGTGGGAGC TGACAATGTA CTTAAGGTAA ACGAGCGGTT GCTGAGTTTA
CCTGCACCAA AAGTATCATC TAGCTCAGAT AATATTGGCT ATTTAGGGCA CATAAATTGG
ATTGGGCCGC AGGCTGAGTG GTGGTTGCAT CAAACAGAAA ATCTAGAGCG CCGCCAACAA
AAGGCAGTAT GGCCGCCAGA TGCCCATACT GTGCTGGCTA GCGCAACGCT AAATGCTATA
TCAGGCAACG CGGTGACTAT GACTTTGCCC GCAAGCCCAG TTACGGGCTT GAGGTTGGAT
AAATCCTATG GGTTGCACGA CGATGGCAGC CTGCAATTAG ATGTAACAGC GACTAATACA
CGGCAAGCCA GTGTGGCGTG GGATATTTGG TTTAACACCC GTCTAAATGC TAATAGCGTG
TTATACGTAC CTGTTGCTGG GGCCCAAAAT GTGCGTATTG ATACGTTTGG GGAAACTCCA
TTTAAGCAAT CGGTTGTTGT TGACGAGGGG ATGTTAACCA TCGATTTAAC AGTTGCCGAT
AGGCTTAAAG GTAAAGCGTT TGTGCAGCCA AGTAAGGGTT GGATGGCCGC ATTCACGGCT
GATCAATTAT TTGTTATTGA GTTTACGCTT CAGCCACAAG CGGTAATTCA CCCAGCCCAA
GGGCAGTTGG AATTTTACCT GGATTACAGT GCAAAGAGCG TGGATGCTGG CTTGCTAGAG
ATGGAGCTGC ATAGCCCTTA CACGCACCTT GAACCGGGGG AGTCGTTCTC AGCACAAGAA
GTTTGGCGTG TTTATAGCTA CGAAGGGCCA AACGCTGCGC TTTATCACCG CAAGCAGCTA
GCGTTGCTGG GGTATAAGTA G
 
Protein sequence
MAMVKNALAK PVCIQRSCVL KRFIQFVSWL LLGIGVIACD SDKPSDNNAM LGAQAHIQQL 
QLSNQISTFT ITPQIGGRGL HFGLVGADNV LKVNERLLSL PAPKVSSSSD NIGYLGHINW
IGPQAEWWLH QTENLERRQQ KAVWPPDAHT VLASATLNAI SGNAVTMTLP ASPVTGLRLD
KSYGLHDDGS LQLDVTATNT RQASVAWDIW FNTRLNANSV LYVPVAGAQN VRIDTFGETP
FKQSVVVDEG MLTIDLTVAD RLKGKAFVQP SKGWMAAFTA DQLFVIEFTL QPQAVIHPAQ
GQLEFYLDYS AKSVDAGLLE MELHSPYTHL EPGESFSAQE VWRVYSYEGP NAALYHRKQL
ALLGYK