Gene CA2559_04165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_04165 
Symbol 
ID9296322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp950515 
End bp953730 
Gene Length3216 bp 
Protein Length1071 aa 
Translation table11 
GC content35% 
IMG OID 
Productouter membrane protein 
Protein accessionYP_003715597 
Protein GI298207418 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAC TTTTCATTGG CTTTGCGTTA TTGCTTTCTG CAATGAGCTT TGCCCAAACT 
ACTATTACCG GAACTGTTAC AACCACAGCT TCTGGAGAAA CAGTGCCATT TGTAAATGTT
ATTTTAAATA ATAGCACGAC AGCTACAACT ACAGATGATA ATGGCCGCTA CAGTATCGAC
ATAAATTCAG AATTAGATGT CCTTAAATTT TCTGCTTTAG GCTTTATTTC TCAATCTATA
ACAGTTGGTA ACAAGACCAT AATAAATGTA GCATTAGAAG AATCTACAAC AGACCTAAAT
GAAATTGTAA TTACGGCTTT AGGATTTAAA CGAGAAACCA AAGAGTTAGG ATATGCCGTA
CAAAGTTTAG GTAGTGATGA TATTCAAGAA GTTAAAGCTG TAAATTTTCT AGATAATTTA
AGCGGAAAAT TAGCAGGTGT TACTATAAGT CAAGGAGCAA CAGGTGTTGG TTCTACTTCA
AAAATAACAA TACGAGGTGA GGCTTCATTT TCTAATAACA ATCCCTTATT TGTTGTAGAC
GGTACACCTA TTAATAATAA TACGGTTTTT AATTTCACAA ATGAAGCAGC AGCAGGTTTT
CAAGAAGTAG ATTTTGGTAA TGGTGCTATG GAGGTTAATC CAGATGATAT CGCTTCGGTT
TCAGTATTAA AAGGACCAAG TGCAGCAGCA CTTTATGGTA CTAGAGCATC TAACGGCGTT
ATTGTTATAG AAACTAAGAA TGGAGCAAAC AAAAAAGGCT TAGGAGTAAG TTATAATACC
AGCCTTTTTA TAGATACTGC ATTTAGGTTG CCAGATTTTC AAAATGAGTA CGGCCAAGGA
AATTCTGGAG AGTTTGAATA TGTAGATGGC TTAGGTGGCG GTATAAACGA TAATATAACA
TATTCTTGGG GGCCAAGGTT AGACCAAGGC TTACTCATAC CACAGTTTGA CAGTCCTGTT
GTACTTGCTA ATGGCACAAT TGTAAGAGGT GGCGATACTT CTGTTTATGA TGGCCAACCA
ATAACTCCTA CAGCATTTAA TTCAAATCCT GATAACCTTA AGGATTTTTA TGAGACAGGT
GTAACAACTA TTAACAACCT ATCAATAGCT ACTGGTTTTA ATACTGGAGA TTTTAGGTTA
TCTCTTACAG ACTTAAGAAG TGATGGTATT ATTCCTGGCG TTAACTTAGA CAGACAAACC
ATATCTACAA AGCTAAATTT TACACCAACT CAAAAAACAA AAATCACGTC TAACATAAGT
TATGTAAACT CTCAAAGTGA TAATAGACCA TCAAATGGAT ATGGTAGTGA AAATGTAAAT
TACTCTTTAG TAGCTTGGGG ACCAAGATCT TTAAATATAG ATAGCTTAAG AGATTACTGG
CAACCTGGTT TAGAAGGAGT ACAACAATAC TCTTTTAACT ACACCTTTTT TGACAATCCA
TATTTTATCC TTTTTGAAAA TAGAAATAGT TTTAATAGAG ATCGTGTTTT TGGAAATGTT
TCTATCAAAC ATAATTTTAC AGAAAAGCTA AGTGTTGCAG TAAGGTCTGG AATGGATTAC
AGTAATGAAA AACGTCAATT CTTAAGAAAT TTTAGTTCAA ATAGATTTAA AAATGGTGCG
TACGCAGAGC ACGATGTTTT TTTTAGAGAA ATAAATACAG ATATCTTAGT AAACTATCAA
GATATTGTTG GAGACCTTTC TTTTGATGTG TCTTTAGGAG GAAACAGATT AGATCAAAAT
GCATCTACCA AACAATCTCA AGCAACTAAT TTAGCACAAC CTGGTATATT TAGTCTTAAC
AATGCAGCTT CTCCTATTGA AGTATTTCAG TTTGAGTCTC AAAAGCGAAT CAATTCTATT
TATGGCTTGG CAAAATTTGG GTATAAAGAC TACCTATTTT TAGACATTAC AGGAAGAAAC
GATTGGTCTA GCGCATTAGC AACTCCTTTT TCGGTAGATG GTACATCTTT CTTTTACCCT
TCAGCTTCAG CAAGTTTTAT ATTGTCTGAG GTAGCAACGC TTCCAAGCAT ATTTTCTTAC
GCTCAGCTAA GAGCAAGCAT AGCACAAGTA GGTAATGATA CTAACCCATA CCAAACCTCA
GGAACGTTTG TCTCTCAAAC CCCATTTAAT AGTCAACCTA CATTTAGCAA TCAGGATTTA
ATTCCAAATG CAAACCTAAA GCCAGAAAGC ACCACATCTT ATGAGGCAGG TTTTGATGTT
CGTTTTTGGA GAGATCGTCT TAATTTAGAT TTTACGTATT ATAATGCATT AACTAAAGAT
CAAATAATAT CATTACCAAT TGGTATATCC TCAGGATATA ACCAACAGGT AGTTAACGGA
GGAAAAGTTC GAACAGAAGG TGTTGAAATT ATTGCAGGCT TAATACCAAT CATAACAGAT
AAATTTAGGT GGACAACCAC ATTTAATTTT AGTAAAAGTG TTGCAACTGT TGAAGACTTA
CCACAAGATG ATGGTCGCCT AACTTTAGGC TTAAGTAGAA TTTATGACAG TGCTAACCAA
ACGGTTTTCT TTCAAGTTGA AGAAGGTGGT CGCGTAGGTG ATTTTTATGG AACAGGCTAT
CTTAAAAACG AAAACGGCGA TTTTATCCTA ACCGATGACG GAAGATATAT TGCAGATAAT
AATTTACAGA AATTTGGAAA CTATAATCCA GATTTTATGC TAGGATGGAA CAACCAATTC
TCCTACGGTA ACTGGAATTT GAGCTTCCTT TTTGATTGGA GACAAGGTGG CGAAATTGTA
TCTAGAACTA GAGCTTTAGG TAATGTAGGT GGACAATTGG CAGAAACTGC TTTTAGACCT
GAAGGCGGCA TTATAGCTCA AGGTGTTGTA AATACGGGTA CTGCAGAAAA TCCTAACTAT
ATTCCTAATA CAACTGCAGT AACTGCAGAA AGCTACTACC GTCAATTTTA TGATAGAAAC
CACGAAGAAA ATAATATATA CGATGCCTCT TATTTAAAGC TAAGACAGTT TTCTGTGGGT
TATACCTTTA AATTAAATGA TGGATTTATA GGACTTAAGG AAGGTGTAGA TGTAAACCTT
TCTTTAGTTG GAAGAAACTT ATTTGCTATT ACAGAAAACC CGCATTTTGA TCCAGAGCAA
TTAGCCGTAC AAGGACAAAG TTTTGTAAGT GGTGTAGAAG ATATGAGTTA TGCCACAACA
AGAAGTATAG GTTTTAAAGC CGGATTTAAT TTCTAA
 
Protein sequence
MKQLFIGFAL LLSAMSFAQT TITGTVTTTA SGETVPFVNV ILNNSTTATT TDDNGRYSID 
INSELDVLKF SALGFISQSI TVGNKTIINV ALEESTTDLN EIVITALGFK RETKELGYAV
QSLGSDDIQE VKAVNFLDNL SGKLAGVTIS QGATGVGSTS KITIRGEASF SNNNPLFVVD
GTPINNNTVF NFTNEAAAGF QEVDFGNGAM EVNPDDIASV SVLKGPSAAA LYGTRASNGV
IVIETKNGAN KKGLGVSYNT SLFIDTAFRL PDFQNEYGQG NSGEFEYVDG LGGGINDNIT
YSWGPRLDQG LLIPQFDSPV VLANGTIVRG GDTSVYDGQP ITPTAFNSNP DNLKDFYETG
VTTINNLSIA TGFNTGDFRL SLTDLRSDGI IPGVNLDRQT ISTKLNFTPT QKTKITSNIS
YVNSQSDNRP SNGYGSENVN YSLVAWGPRS LNIDSLRDYW QPGLEGVQQY SFNYTFFDNP
YFILFENRNS FNRDRVFGNV SIKHNFTEKL SVAVRSGMDY SNEKRQFLRN FSSNRFKNGA
YAEHDVFFRE INTDILVNYQ DIVGDLSFDV SLGGNRLDQN ASTKQSQATN LAQPGIFSLN
NAASPIEVFQ FESQKRINSI YGLAKFGYKD YLFLDITGRN DWSSALATPF SVDGTSFFYP
SASASFILSE VATLPSIFSY AQLRASIAQV GNDTNPYQTS GTFVSQTPFN SQPTFSNQDL
IPNANLKPES TTSYEAGFDV RFWRDRLNLD FTYYNALTKD QIISLPIGIS SGYNQQVVNG
GKVRTEGVEI IAGLIPIITD KFRWTTTFNF SKSVATVEDL PQDDGRLTLG LSRIYDSANQ
TVFFQVEEGG RVGDFYGTGY LKNENGDFIL TDDGRYIADN NLQKFGNYNP DFMLGWNNQF
SYGNWNLSFL FDWRQGGEIV SRTRALGNVG GQLAETAFRP EGGIIAQGVV NTGTAENPNY
IPNTTAVTAE SYYRQFYDRN HEENNIYDAS YLKLRQFSVG YTFKLNDGFI GLKEGVDVNL
SLVGRNLFAI TENPHFDPEQ LAVQGQSFVS GVEDMSYATT RSIGFKAGFN F