Gene Sden_3686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSden_3686 
Symbol 
ID4020243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella denitrificans OS217 
KingdomBacteria 
Replicon accessionNC_007954 
Strand
Start bp4419337 
End bp4421100 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content47% 
IMG OID637957745 
Producttriple helix repeat-containing collagen 
Protein accessionYP_564682 
Protein GI91795031 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA AAATAGCACT CGCTATGGGA TTAATAGTTG TGGGTATAAG CCCACCTCTA 
TTTGCTGATA GCATGAAAAC TAGCTATTCG TTTAATCAAG CTAACGGTTA TCCGGGTAAT
GGTGAGCAAA ATAGCAATGG TAATAAAGGT TACTTCCCAT TATCGGGTAA TGGTCAGTTT
ATTACAGCAG GCCGTTCAGG GCTAGCGTTA GCCCAGGATG TGATGGTTGT TGGCGGATTA
GCCCCCTTTA ATAAAGATAC ACCGTTAACG GTTTCTTTAT GGATAAAGCC TACTCAAGAT
GCCGCTGGTG GCACCATTTT ATCCAGAAAA CACAGCGAAA ATGGCCAAGG CTTTAGCGTC
GCATTTGATT TTGATATGAA TCTGGAATTC ACACTCGTCG ATGAATTTGG CGGCTCAATT
TCAGTGGCCA CCTTAACCCC AGTGAATGTC GATCAAATGT GGCATCATGT TGCCGTTTCT
TATAAGGGTG ATGCGAATGC ATCGAATATG GCCCTATATA TCGATGGTCA AGTGACTGAA
TTAAAGCTGT CTTCAAATAG TTTGACCGGC CAAGTTGAGA CCTACCATCC CTTAGTCATA
GGTGGCAGCT CGTCATATAC CCAAGCACTC GCTGCAGAAA TCGATGAAGT GTATTTAGTG
CCTCAGAACT TTAATGCTGA GCAAGTGACG TGTCTTTATC AATTGAAAAC GGATTGTGCT
TACAGGCCAA CCACAGGCAA AGAGGGCCCT CGAGGCCCCA TAGGTGAGCT TGGGGAGCAG
GGTGATAGAG GTGCAACCGG AGTGAGCGGC CTAGCGGGCG ATGTGGGCCT CAAGGGTAAC
GCGGGTTTAC CAGGGCCAGT AGGCCCACAA GGTCCTAAAG GGCCTCAAGG TTTCACGGGA
CCTACTGGTT TAGCCGGAAT AGATGGCTCA GATGGTATTG ATGGCTCCGA TGGCACCAAT
GGTGCACCTG GTGCTCAAGG TGCTGTGGGT GATAGCGGCA TTCAAGGGCC ACAAGGTTCA
AAGGGATTGC AAGGAAACGT TGGGCCTAAA GGAGCTAGCG GAGATCGTGG TGCGCAAGGT
GCCATGGGTA ACCAAGGCGT TGCGGGCATC AAAGGCTCTC AAGGTGCGCA AGGACCCACA
GGTTATACCG GTGGTGCGGG TGTACAAGGC CCCGCTGGCT ATAATGGCCC ACAAGGTCCT
CAAGGAAACC CAGGTTTGAC TGGTTACCCG GGGACGCCAG GTAGTGACGG GCCACAGGGT
GCCACTGGGC CTAAAGGTAC TGACATTAAG GGTTATGCGG GGACTCCTGG TGCCATTGGC
CCTCAGGGCC CAAGAGGAAG ACAGTTTTCA GTAGCTGAAT GTCGCATGGG GGCTTTTTCT
TTTGATAATA AGACATTATC ACAATCTAAG GATTTCACGG CATTTACATC AGGTATTAAT
CTTGCCTTAA AAGGTGAACC CAATATTGGA CCTGCTCCAG GGAAACCTGA ATTAGATGGT
GTCATAGATA CTGAATATAT TCTATCTCGT GCCATGCGGG CTAATAAGGC TGGGATTATA
TATATTAATT TTCATGTATT AGCGATTGAA ACTAAAGAAG AAGAAGAAGC TTTCTTTTCG
GCGTTAGATA CAAGCGACAA AGCTGTGCAT GATTTTATTG TGGATTTATA CAGCCAAAAA
GAACTCGACA ACAGATATGC AGCTGAAATA GACAACATAA TGGCGGCGCC CACTCCACTT
CGATTTGAAG GAAGGACTCA GTAA
 
Protein sequence
MKNKIALAMG LIVVGISPPL FADSMKTSYS FNQANGYPGN GEQNSNGNKG YFPLSGNGQF 
ITAGRSGLAL AQDVMVVGGL APFNKDTPLT VSLWIKPTQD AAGGTILSRK HSENGQGFSV
AFDFDMNLEF TLVDEFGGSI SVATLTPVNV DQMWHHVAVS YKGDANASNM ALYIDGQVTE
LKLSSNSLTG QVETYHPLVI GGSSSYTQAL AAEIDEVYLV PQNFNAEQVT CLYQLKTDCA
YRPTTGKEGP RGPIGELGEQ GDRGATGVSG LAGDVGLKGN AGLPGPVGPQ GPKGPQGFTG
PTGLAGIDGS DGIDGSDGTN GAPGAQGAVG DSGIQGPQGS KGLQGNVGPK GASGDRGAQG
AMGNQGVAGI KGSQGAQGPT GYTGGAGVQG PAGYNGPQGP QGNPGLTGYP GTPGSDGPQG
ATGPKGTDIK GYAGTPGAIG PQGPRGRQFS VAECRMGAFS FDNKTLSQSK DFTAFTSGIN
LALKGEPNIG PAPGKPELDG VIDTEYILSR AMRANKAGII YINFHVLAIE TKEEEEAFFS
ALDTSDKAVH DFIVDLYSQK ELDNRYAAEI DNIMAAPTPL RFEGRTQ