Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CA2559_10733 |
Symbol | |
ID | 9297635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Croceibacter atlanticus HTCC2559 |
Kingdom | Bacteria |
Replicon accession | NC_014230 |
Strand | - |
Start bp | 2340080 |
End bp | 2342041 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | |
Product | capsular polysaccharide biosynthesis protein CapD |
Protein accession | YP_003716890 |
Protein GI | 298208711 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTATA GTGAAAATAA GATTAATGTT CGTAATATTA AATATCTTCC AAGATGGGTT GTATTAATAA TTGATACATT ACTTATTTGT ATTTCTTTAA TTGTATCAAT AATAATACTT AATAGAATTA GAATAAGTCC TTTGGATGTT TTAGAATATT CCATTCAAGC ATCTATTGTA ATTTCTGTTA ATGTAATCTT CTTCTTTTTA TATAAAACAT ACTCAGGACT TATAAGACAC TCTTCAATGA TGGATGCATT AAAGCTGTTT TTAGCTTCAA TGAGTACCAC AGTTGTATGC CTTATGTTAA ATGTATTTTA TCAAGGCACT TATGGTGATA AGATTTTTAT TACTACCGGT CTTATAATGT ATTCATTTAT ATCATTTTCG ATTTTATTTT TATTTAGGAT AGCAATTAAA CAACTCTATG AGTTTTTTAA GATAGCACAA CGTGAAGAAG ACCTAATTGA TGTAGCAATT CTCGGAATAG ATGAAAATGC AATATCTATT GCTGCAGCAT TAGAAATGGA GCACCCTAAA CGTTTTCTAG TAAAGGGATT TGTGTCTAGA AATAACTTTA AAAGACAGTT AAGAATATTA GAAAAGCCTG TTGTTAGAGT AAAGGAGTCT ATTTTACCAT CATTAAGAAT ATTAAATGTA AAGGCTATAA TAATTACTTC TGCTATAGAC CCAAAAGAAA AGCAGCAAAT AGTAGATGAG TGTTTAGAGA ATGACATTAA GATGTATAGT TCTCCAATCT TATCAAATTT TGAGGAAGGT AATTCTCCAC AAATACAAAA GCTGCAAATT GAGGATTTAT TAGAAAGAGA GCCTATTGTA TTAAATAAAA AAAATAAAGA ACATCAATTA AAAAATAGAA CTGTCTTAGT AACTGGTGGA GCAGGATCTA TAGGTAGTGA AATTGTTAGG CAAGTAGCAG AATACAAACC TAAGATGTTA TTAGTATTAG ATCAAGCAGA ATCTCCTTTG CATCAAATAC AACTGGAGAT AAATGAGCAC TTTCCTAACC TTAAGTACAA ATGTATAATT TGTGATGTTA CTAACAACGT TCGCTTGCAA AGTGTTTTTG ATGAGTTTGA TATTGATGTT ATTTATCACG CAGCGGCATA TAAACATGTG CCACTTATGG AGAACAACCC TCATGAAGCT ATTTTAACAA ACATTCTTGG AACAAAACAA GTTGCAGATT TAGCTGCAAA GTTTAAAGTA GGTCACTTTG TAATGGTATC TACAGATAAA GCAGTAAACC CTAGTAATGT TATGGGAGCT TCTAAAAGAG CTTCAGAAAT GTATGTGCAA TCTCTAAATT ATAACTTACA ACTGAAAGAT AGATGTGCTA CAAAATTTAT AACTACAAGA TTTGGTAATG TTTTAGGATC TAATGGATCT GTTGTCCCTT TATTTAAAAA ACAAATTGAA GAAGGCGGAC CAGTTACCAT CACACATCCA GATATAATTA GATATTTTAT GACAATCCCA GAAGCCTGTC AATTAGTTTT GGAAGCTGGT GCTATGGGTA AAGGTGGTGA GATATTTATT TTTGATATGG GTAAACCTGT CAAGATTATG GATTTAGCCA AGAAAATGAT AAAGTTGGCT GGATTTATAC CAAACAAGGA TATACATATT AAAATAACTG GATTAAGGCC AGGAGAAAAG TTATACGAAG AACTGCTATC AGATGAAGCT AAAACTCTAC CTACACATCA TGAAAAAATT ATGATTACTA AAGATAGAAA TGATAACTAT GAGTTTATCT CAAGCAGTAT AGATAGCGTT ATTAATTCTG CAAATAATCA TAATAATGTG CGTGTGGTTC ACAAGCTTAA AAAAATGATT CCAGAATTTA AAAGTAAGAA TTCATCTTAT GAGTCATTAG ATGCAGAGGT GTCGTTTAAC GAGATAATAA ATGAGGATAC CGTTAAAATA AAAAAGCTTT AA
|
Protein sequence | MSYSENKINV RNIKYLPRWV VLIIDTLLIC ISLIVSIIIL NRIRISPLDV LEYSIQASIV ISVNVIFFFL YKTYSGLIRH SSMMDALKLF LASMSTTVVC LMLNVFYQGT YGDKIFITTG LIMYSFISFS ILFLFRIAIK QLYEFFKIAQ REEDLIDVAI LGIDENAISI AAALEMEHPK RFLVKGFVSR NNFKRQLRIL EKPVVRVKES ILPSLRILNV KAIIITSAID PKEKQQIVDE CLENDIKMYS SPILSNFEEG NSPQIQKLQI EDLLEREPIV LNKKNKEHQL KNRTVLVTGG AGSIGSEIVR QVAEYKPKML LVLDQAESPL HQIQLEINEH FPNLKYKCII CDVTNNVRLQ SVFDEFDIDV IYHAAAYKHV PLMENNPHEA ILTNILGTKQ VADLAAKFKV GHFVMVSTDK AVNPSNVMGA SKRASEMYVQ SLNYNLQLKD RCATKFITTR FGNVLGSNGS VVPLFKKQIE EGGPVTITHP DIIRYFMTIP EACQLVLEAG AMGKGGEIFI FDMGKPVKIM DLAKKMIKLA GFIPNKDIHI KITGLRPGEK LYEELLSDEA KTLPTHHEKI MITKDRNDNY EFISSSIDSV INSANNHNNV RVVHKLKKMI PEFKSKNSSY ESLDAEVSFN EIINEDTVKI KKL
|
| |