Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CA2559_03670 |
Symbol | |
ID | 9296221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Croceibacter atlanticus HTCC2559 |
Kingdom | Bacteria |
Replicon accession | NC_014230 |
Strand | + |
Start bp | 835789 |
End bp | 838887 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003715498 |
Protein GI | 298207319 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTATT CAACCAAAAT TCTATTTCTC CTAACCACTT TATTCTGTTC ATTTTTTTCT GAAGCTCAAA CTGTCAACGA AGACCTATAT GACGCATTAG AGTATCGCTT AATTGGCCCT TTTCGCGGTG GTCGTAGTGC CGCAGTAACT GGCGTGCCAG ACCAACCTAA TCTGTTCTAT TTTGGGGCAA CTGGTGGTGG CGTTTGGAAA ACCTTAAATG GAGGTCGCAC TTGGGAAAAT ATTTCTGATG GCTACTTTGG TGGCTCTGTC GGTTCTATTT CTGTTGCTGA AAGTGATAAA AATGTTATCT ACGTTGGTGG TGGCGAGAAG ACCGTTCGTG GTAATGTTTC CTCAGGTTAT GGTGTGTGGA AGAGTGTTGA TGCTGGTAAA ACTTGGAAAG CATCAGGATT AGATAAAAGC CGACATATCT CCAGAATTCG CATTCACCCT AAAAATCCAG ACATTGTTTA CGCTGCTGTT ATGGGTAACC TTTACAAAGG CACACAGGAA CGTGGCGTTT ATAAAAGTAC AGATGGTGGA AAAACCTGGA CTAAAAAACT TTTCGCCAAT GAAGATGCAG GTGCAGTCGA TTTAACCTTT GATCCAAACA ATCCACGAAT TTTATACGCA TCAACTTGGA ATATCCGCAG AACACCATAC AGTTTAAGTT CTGGTGGTGA TGGTTCTGCA CTTTGGAAAA GTACCGATGA AGGTGAAACG TGGACAGAAA TTTCAAAAAT GAAAGGTTTT CCTGAAGGTA CTTTAGGTAT TATTGGTGTG ACGGTTTCCC CAGTAAATTC AGAACGTGTT TGGGCTATTG TCGAGCATAA AGATAAAGGC GGCTTATACC GAAGTGAAGA TGGTGGCGAA TCGTGGAGCC AAGTGAATGA CGAGCGAAAA ATTCGTCAAC GTGCTTGGTA TTACACAAGA GTCTATGCCG ATACACAAGA TGAAGATGTG GTGTATGTGT TAAATGTGCG CTACCACAAA TCAGAAAACG GCGGAAAATC TTTTGAAACC TACAACGCGC CGCACGGCGA CCATCACGAT TTATGGATTG CGCCAAATGA CCCAACTCGA ATGATTATTG GTGATGATGG TGGCGCACAA GTGACTTATG ATGGTGGCGA AACTTGGAGC ACCTATCACA ATCAGCCAAC GTCGCAGTTT TATCGTGTGA CGACAGACAA TTCATTTCCA TATCGCATTT ATGCTGCACA ACAAGATAAC AGCACAGTAA GAATTCCTCA CAGGACTGAA GGCCGCTCGA TTTCAGAGGA CGACTGGGAG TCTACTGCTG GTGGTGAAAG TGCTCATATT GCAGTTGACC CAGAAAATAA TGACATTGTT TACGGCGGAA GTTATGACGG TTTTCTAACA AGAGTGAATC ACGACAAAAA CACGGTGCGA AGCATTAGTG TTTGGCCAGA CAATCCTATG GGACACGGCG CTGAGGATAT GAAATATCGC TTTCAATGGA ATTTCCCTAT TGAATTCAGC AAGCATAATC CTGATAGGTT GTATACCTTT TCAAATCGCG TACACGTTAC TGAAGATGAA GGGCAAAGCT GGAAAGTAAT TTCGCCAGAT TTAACCCGAA ACGACCCAGA AAAGTTGAAA TCTTCTGGAG GTCCAATTAC GCAAGACAAC ACATCTGTAG AATATTATTG TACGATTTTC GCTGCTCAAG AAAGCCCGTT AAAGGAAGGC TTGCTTTGGG TTGGTAGTGA TGATGGTTTG GTGCACGTGA CCAGAAATGG TGGAGAGACT TGGGATAACG TGACGCCGAA GAATATGCCA GAATGGACAA TGATTAACAG TATTGAACCA AGTGCATTTG ATGAAGGTAC GTGCTATGTT GCAGGAACGC GATACAAATG GGGAGATTTT CAGCCTTACT TATATAAGAC TACAGATTAC GGGAAGTCTT GGACTAAGAT AACAAACGGT ATCAACGAAG AGCATTTTAC AAGAGTACTT CGTGAAGACC CGAAACAAAA GGGATTACTA TATGCAGGAA CAGAAACAGG AATGTACATT TCGTTTAACG ATGGTAAAAA CTGGAACCCA TTTCAGTTGA ATTTACCAAT AGTTCCTATT ACAGATTTAA CCATAAAAGA CAACAATCTA ATTGTCGCAA CACAAGGTCG AGGCTTATGG ATAATTGATG ATTTGAGTGT TATTCATCAG GCGATGAAAA TGAGCAATAA AGATGTTGCT TTAATGAAGC CGAAGCCAAC GTACAGAATG CAAGGTGGAA GTCGTGAAGG CTCCTTGACT TCAGGAACAA ATCATCCAAG TGGTGTGATG ACGTATTTCT ATCTGAAGAA TTATGATGAG AAAAAGGACA CGATTTCAGT GACCTATTTA AATAAACAGA ATGACACGTT GAAGAGTTTC AGCAACCATT CAAAGAAAGA TAAGTTAGAC GTAGAACAAG GTGCGAATCT GACAACTTGG GATACACGTA GCAAAGGTGC TGAGGTTTTA GACGGAATGA TTTTATGGTG GGCAAATCTA GATGCGCCAA GAGCCTATCC AGATACTTAC AAAGTTAGAT TGAATGTTAA CGGAAAAGAT GAAGAACAAT CGTTTGAAAT CATTCCAAAC CCAAACAGTG AGTCTACTGC TGCAGATATG AAAGCGCAGT ATGAGTTTAT TAGCGATGTT AATGAAACGG TAGATAAAGC GCACAAATCC ATAAAGAATA TTCGAGCGAT TAATAAGCAA TTGAAAGATT TTCAAGAGCA GTATAAGGAT GATGAACGCA CAAAAGAGTT GAGAGAAAAA GCTAAAAAAC TTCAAGACGA TTTTACCGCT ATTGAAGAAG CGTTATACCA AACCAAAAAC CGAAGCGGAC AAGACCCTTT AAATTTTCCA ATCAAGTTGA CGAACAAATT AGGACACTTA AACAGCTTAG TAGGAATGGG TGATTTTGCA CCAACAGAGC AAGACAAAGC CGTAAAACGT GAACTAGAAC AACAAATAAA TGCAGAGTTG GTGAAGTTTG ACAATTTGGT GACGCAAGAG ATTTCAGAAT TCAATACACA GTTTAATAAT TTGAAGTTGA ATTATTTATT TGTTGAGGAT AAGGAGTAA
|
Protein sequence | MNYSTKILFL LTTLFCSFFS EAQTVNEDLY DALEYRLIGP FRGGRSAAVT GVPDQPNLFY FGATGGGVWK TLNGGRTWEN ISDGYFGGSV GSISVAESDK NVIYVGGGEK TVRGNVSSGY GVWKSVDAGK TWKASGLDKS RHISRIRIHP KNPDIVYAAV MGNLYKGTQE RGVYKSTDGG KTWTKKLFAN EDAGAVDLTF DPNNPRILYA STWNIRRTPY SLSSGGDGSA LWKSTDEGET WTEISKMKGF PEGTLGIIGV TVSPVNSERV WAIVEHKDKG GLYRSEDGGE SWSQVNDERK IRQRAWYYTR VYADTQDEDV VYVLNVRYHK SENGGKSFET YNAPHGDHHD LWIAPNDPTR MIIGDDGGAQ VTYDGGETWS TYHNQPTSQF YRVTTDNSFP YRIYAAQQDN STVRIPHRTE GRSISEDDWE STAGGESAHI AVDPENNDIV YGGSYDGFLT RVNHDKNTVR SISVWPDNPM GHGAEDMKYR FQWNFPIEFS KHNPDRLYTF SNRVHVTEDE GQSWKVISPD LTRNDPEKLK SSGGPITQDN TSVEYYCTIF AAQESPLKEG LLWVGSDDGL VHVTRNGGET WDNVTPKNMP EWTMINSIEP SAFDEGTCYV AGTRYKWGDF QPYLYKTTDY GKSWTKITNG INEEHFTRVL REDPKQKGLL YAGTETGMYI SFNDGKNWNP FQLNLPIVPI TDLTIKDNNL IVATQGRGLW IIDDLSVIHQ AMKMSNKDVA LMKPKPTYRM QGGSREGSLT SGTNHPSGVM TYFYLKNYDE KKDTISVTYL NKQNDTLKSF SNHSKKDKLD VEQGANLTTW DTRSKGAEVL DGMILWWANL DAPRAYPDTY KVRLNVNGKD EEQSFEIIPN PNSESTAADM KAQYEFISDV NETVDKAHKS IKNIRAINKQ LKDFQEQYKD DERTKELREK AKKLQDDFTA IEEALYQTKN RSGQDPLNFP IKLTNKLGHL NSLVGMGDFA PTEQDKAVKR ELEQQINAEL VKFDNLVTQE ISEFNTQFNN LKLNYLFVED KE
|
| |