Gene Sde_1523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1523 
Symbol 
ID3965051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1964450 
End bp1965859 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content46% 
IMG OID637920601 
Producthypothetical protein 
Protein accessionYP_526997 
Protein GI90021170 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins
[COG3455] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03349] type IV / VI secretion system protein, DotU family
[TIGR03350] type VI secretion system OmpA/MotB family protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.414466 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.78534 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAACT CGGATATAAC AGTAATTAAG CCTCGACCAG GAAAGCGAGC AAACGCGCCC 
CAACAAAGCG TAACTCCGTC GTCGGATCAG ACCATAATAA AGCCGCGCCC TCGCCCCGGC
GGCAAGCGCA CACCTCCCAC TGCGGCAATG GGCGATACGA CTGTTATAAA GCCTCGCCAC
CCAAACCAAT CGGCAACTGT AGACTTAAAA TTCGCAGCCA TAGAGAACAC ACACGCAACA
ACAATTGCCG AGGCGGCATC CCCCGTTTTG GCGCTTGCGA CTCAACTAAA GCAAATTCAA
GGTAAAGTAG ATGGCCAGCG CCTACGAAAT TATGTGCAAG AGTCCATTAA ACAGTTTGAT
AGCAAAATAA CCAAACTTGA AAGCGATATT CAAATAAGGC AAGACGCCAA CTATATACTG
TGCGCTCTTA TTGACGAGAC CATTCTAAAT ACCACTTGGG GTGAGATGTC TGGCTGGAGC
CAACACCCTA TACTCAGTAT TTTTCACAAA GAAACCTACG GTGGAGAAAA ATTCTACCGC
ATATTAGATA CAAGCTTAGA ATCCCCTTAC GAGCATAAAG ATTTACTAGA AATACTTTTT
GTTGCAATGT CACTTGGTTT TATGGGAAAG CTGAGAATAG ACCCTCAAGG CCCTATAAAA
ATTGAAAAAA TCCGTAGTCG TCTTTACGAC GTACTACATA GATCCAGAGA GAAATACAAC
AACACCCTAT CGGTTAATAT TTCTCCGCAA ATCAGTAATA AGCAACACCT ATACTCATTT
TTACCTGCTT GGCTGTTAAT CGGCACACTT GTGCTCGCCG CATTTGGCCT ATACAGCTAT
TGGCTTATCG GCTTAAACAA AGAGTCGGAC ACAACGCGCG TACTAATGAC TAACCTAATT
CCCACGCCCG AGAAAAAAGT ACTCGACCCC AGTATGGTTC GGCCTGAATT TATCGAGCTG
CGCGCTTTAC TTACCCCAGA AATAGACAGA GGTATTTTAA GTGTGCAGGA CTACCCGACT
CACACCTCGA TAGTGCTGCA CAACCAAGAG CTTTTCACCT CTGGCAATAT CAATATCACA
CCCTCTTTTG AGCCAATATT AGATAAAATA GCCAAGGCAC TGGAAGCCAT ACCGGGCAGA
ATTATCGTAT CTGGCCATAC CGATAACGAA TCTATTCGCA CACCGCGCTA CCCTTCTAAC
TGGCACCTCT CCCTTGCTCG CGCAAGTGAA GTGGTTAAGT ACTTAGCTGC AAGTGCAGAC
CTTAAATCAC GCCTACTACC TGAAGGACGC GGGGCTAACG AACCCATTAT GAGTAACGAC
ACCGCCGCTG GGCGCGCCCA CAATCGCCGT GTTGTTATCG ACGTTTATTA CCATCAAGGA
TTAATAGCTA GCGAACAGCA AGCTAAATAG
 
Protein sequence
MDNSDITVIK PRPGKRANAP QQSVTPSSDQ TIIKPRPRPG GKRTPPTAAM GDTTVIKPRH 
PNQSATVDLK FAAIENTHAT TIAEAASPVL ALATQLKQIQ GKVDGQRLRN YVQESIKQFD
SKITKLESDI QIRQDANYIL CALIDETILN TTWGEMSGWS QHPILSIFHK ETYGGEKFYR
ILDTSLESPY EHKDLLEILF VAMSLGFMGK LRIDPQGPIK IEKIRSRLYD VLHRSREKYN
NTLSVNISPQ ISNKQHLYSF LPAWLLIGTL VLAAFGLYSY WLIGLNKESD TTRVLMTNLI
PTPEKKVLDP SMVRPEFIEL RALLTPEIDR GILSVQDYPT HTSIVLHNQE LFTSGNINIT
PSFEPILDKI AKALEAIPGR IIVSGHTDNE SIRTPRYPSN WHLSLARASE VVKYLAASAD
LKSRLLPEGR GANEPIMSND TAAGRAHNRR VVIDVYYHQG LIASEQQAK