Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CA2559_08751 |
Symbol | |
ID | 9297234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Croceibacter atlanticus HTCC2559 |
Kingdom | Bacteria |
Replicon accession | NC_014230 |
Strand | + |
Start bp | 1915723 |
End bp | 1918728 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | |
Product | putative protein-export transmembrane SecDF protein |
Protein accession | YP_003716497 |
Protein GI | 298208318 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.392094 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAACA AAGGATTAAT TAGAGTATTT GCCATTTTGT TTGGGTTGGT CTCTCTGTAC CAACTTTCAT TTACTTTTTT TACTAACAAG ACAGAAAGCG ATGCAGAAGC ATTTGCAAAG CAACAAATTA GCGAGAGTAC AGAAAACTAC TCTGCTCTTC GTGAAGAGGT AGAAACTCGT TATCTAGATT CTATAGGAAA CGAAGAAGGT TTCTTAGGGA TGACTTACAA TGAATCTAAG GATAAAGAAC TTAATAAAGG TCTTGACCTT AAAGGTGGTA TAAACGTTAT TCTTCAAGTT TCTGTAAAAG ATATCTTAAG AGGTTTATCT AATAACTCTA AAGATCCAGC ATTTAACCAA GCCTTGGCTA ATGCAGACGA AGCTCAAAAA GATGCTCAAG ACGATTATAT CGATTTATTT CTTGAAGCAT TTAATGACAT TCCAGATGCA AGATTAGCGT CTCCAGATAT CTTTGCTAAT AAAACACTTA GCGACGAGAT TAACTTCGAG ATGTCTAATG CTGAAGTTGC AACAGTATTA AAGAAAAAAA TTGATGAAAG CATTGTTTCT GCTTTCGAGG TATTACGTAA GCGTATAGAT AAGTTTGGGG TTACACAACC TAACATTCAA CGTTTAGGAG AATCTGGACG TATCTTGGTA GAATTACCAG GAGCTAAAGA TGTAGACCGT ATAAGAACGC TATTACAAAG TACAGCTCAA TTGGAATTTT GGGATGGATT AGTAGGCTCT AATTTTGGTC AGTTTTTAGT TGATGCAGAT GCTTACATAA AAGAAACTCA ATCAACTTCT ACTGAAATTG AAAATGCAGA GCAAGTAACA GATTCTACTG AAACTGCAGC AGGAGATGAT TTAGAAGATT TATTAGCTGA AGAAAACGAC TCTACAGAAG TTGCTTCAGG TAATAACCCG TTATTAGAAC TTATAGTAGC ACCAGGTTAC CAAGGTGGTC CTGTAATAGC TGTCTTTAAC GCTAAGGATA CTGCTAAAGT TAATAGCTAT TTAAAGAAAC CTCAAGTAAA AGCTTTATTA CCAGCAGAAC AGAAATACAC TAAATTTCTT TGGGGAATAG AAGATGAGAA TGGTTTAGTG CAGTTATACG CTGTAAAAGG AAACCGTGAT AATGAGCCAG AACTAAGTGG AGGTGTTGTA ACAGATGCTG CACAGGTATA TAACCAAGCT GGTCAAGTAG CAGTTTCTAT GCAAATGAAT GGAAAAGGAG CTAAGATCTG GGAAGAAATG ACTGGTCGTG CTTACCAACA ACAATCTACA ATAGCAATTG TGCTTGATGA TGTGGTTTAC TCTGCGCCAT CATCTACAAG TGGTGCAATT TCTGGAGGTC GTACAGAGAT TTCAGGAAGC TTTACTGTTG CAGAAGGACA AGATTTAGCA AATGTTTTAC GAGCAGGTAA GTTACCAGCA TCAGCTGATA TTATACAGAG TGAAGTAGTA GGACCATCAT TAGGGCAAGA AGCAATTGAT AGTGGATTAA TGTCATTTGT GATAGCATTG TTATTTGTAT TAGTTTGGAT GATATTCTAT TATGGTCGTG CAGGAGCTTA TGCTGATGTT GCTTTAGTTG TAAACATTTT ATTCATATTT GGAATTTTAG CAGGATTAGG AGCTGTGTTA ACTTTACCAG GAATTGCAGG TATAGTATTA ACAATAGGTA TTTCGGTTGA TGCCAACGTA CTTATTTTCG AACGTATTAG AGAAGAGTTA GCAAAAGGAA AATCTCAGAA AGACTCTATT AAAGATGGTT TCAATAATGC ACTATCTTCA ATTTTAGATG CAAACATTAC AACAGGTCTA ACAGGTCTTA TATTATTAGT ATTTGGTACG GGACCAATTA AAGGTTTTGC AACAACGTTA TTAATAGGTA TTGCAACATC ATTATTTACT GCAATTTTCA TTACAAGATT ATTTATTGAC GGTTATGGTA AAAATGGAAA ATCATTAGCT TTTTCAACAC CAATCACTAA AAAATGGTTC CAGAACGTAA ATGTAAACTT CCTTAAAAAA CGTAAAATTG CTTATGTGAT TTCTGCAGCA ATTATAGCTG TAGGCTTAGG ATCTTTATTT ACACAAGGAT TAGATCAAGG TGTTGATTTT GTTGGAGGAC GAACTTACAC GGTACGTTTT GATAAAGATG TAGATGCTAA CGAAATTGAG CAAGACTTAA TTGCAACTTT TGAAAGTGCA GAGGCTAAAA CATTAGGAGC AAATAATCAA TTAAAGATTA CAACTAAGTA CAAGGTAGAA GAAACAAGTA CAGAAGTTGA TAATGAAATA CAGGAAATGC TTTATAACTC GCTAAAGCAA AACCTTCCTC AAGGCATGAC TTATGAAGAC TTTAAGCAAG AATCTTCAGA TAAGGAAGCT GGTTTAATGA GTAATTACAA AGTAAGCCCT ACAATTGCAG ATGATATTAA ACAGGCTTCT GTTTGGGCTG TTTTAGGATC ACTGTTAGTA GTATTCCTTT ATATCTTATT ACGTTTTAGA AGATGGCAGT TCTCTTTAGG AGCTGTTGCT GCTGTATTCC ACGATGTATT AATTGTATTG GGTATATTCT CATTAACATA CAAGTTTATG CCTTTTAATA TGGAAATAGA TCAGGCATTT ATTGCTGCCA TACTTACAGT AATAGGTTAC TCTTTAAATG ATACGGTAGT TGTCTTTGAT AGAATACGTG AGTATTTTAA TGAAAATGAA CAATGGCCAA TGTCTAAGAT TATAAATAGT GCATTAAGTA GTACCTTGAG TAGAACGTTA AATACCTCTT TAACAACGTT AATAGTGTTA CTAGCAATAT TTATTTTCGC AGCACCACTT AGAGGCTTTA TGTTCTCATT AATAATAGGT GTAGTAGTTG GTACTTATTC TTCATTATTT ATTGCAACTC CAGTAATGTT TGATACAGTT AAAAAACGTG GTATTGATCT TAAGTATAAA GAGAAGGAAG AACTTGAAAA TGATGCTACA GTTTAA
|
Protein sequence | MQNKGLIRVF AILFGLVSLY QLSFTFFTNK TESDAEAFAK QQISESTENY SALREEVETR YLDSIGNEEG FLGMTYNESK DKELNKGLDL KGGINVILQV SVKDILRGLS NNSKDPAFNQ ALANADEAQK DAQDDYIDLF LEAFNDIPDA RLASPDIFAN KTLSDEINFE MSNAEVATVL KKKIDESIVS AFEVLRKRID KFGVTQPNIQ RLGESGRILV ELPGAKDVDR IRTLLQSTAQ LEFWDGLVGS NFGQFLVDAD AYIKETQSTS TEIENAEQVT DSTETAAGDD LEDLLAEEND STEVASGNNP LLELIVAPGY QGGPVIAVFN AKDTAKVNSY LKKPQVKALL PAEQKYTKFL WGIEDENGLV QLYAVKGNRD NEPELSGGVV TDAAQVYNQA GQVAVSMQMN GKGAKIWEEM TGRAYQQQST IAIVLDDVVY SAPSSTSGAI SGGRTEISGS FTVAEGQDLA NVLRAGKLPA SADIIQSEVV GPSLGQEAID SGLMSFVIAL LFVLVWMIFY YGRAGAYADV ALVVNILFIF GILAGLGAVL TLPGIAGIVL TIGISVDANV LIFERIREEL AKGKSQKDSI KDGFNNALSS ILDANITTGL TGLILLVFGT GPIKGFATTL LIGIATSLFT AIFITRLFID GYGKNGKSLA FSTPITKKWF QNVNVNFLKK RKIAYVISAA IIAVGLGSLF TQGLDQGVDF VGGRTYTVRF DKDVDANEIE QDLIATFESA EAKTLGANNQ LKITTKYKVE ETSTEVDNEI QEMLYNSLKQ NLPQGMTYED FKQESSDKEA GLMSNYKVSP TIADDIKQAS VWAVLGSLLV VFLYILLRFR RWQFSLGAVA AVFHDVLIVL GIFSLTYKFM PFNMEIDQAF IAAILTVIGY SLNDTVVVFD RIREYFNENE QWPMSKIINS ALSSTLSRTL NTSLTTLIVL LAIFIFAAPL RGFMFSLIIG VVVGTYSSLF IATPVMFDTV KKRGIDLKYK EKEELENDAT V
|
| |