Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Paes_2008 |
Symbol | |
ID | 6459833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prosthecochloris aestuarii DSM 271 |
Kingdom | Bacteria |
Replicon accession | NC_011059 |
Strand | + |
Start bp | 2205183 |
End bp | 2206526 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642725991 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002016665 |
Protein GI | 194334805 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000403932 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.229254 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTCAG ATATTTTCGA ACTTTTTATT CTTCTTTGCC TCATACTGGC AAACGGCTTT TTCTCCATGG CGGAATTCGC CATTATCTCA TCAAGGGAGA CCAAATTACA TGAATTGCAC GAAGCCGGTG TTTCGAGAGC GGGCCTTGTC CTCGAACTGC TTGATAACCC CGGAAAATTT CTTTCGGCCA TTCAGGTAGG GATTACCCTG ATCGCAACAC TCGCAGGGGC ATTCAGCGGC ATCACTCTGT CTGCGCCCAT AGCGGAAATG ATCGAGCGTG CAGACGCGCT CAAACCATAC AGCAATGAGC TTGCTCTTGG CCTCGTGGTT ATAGGCGTCA CCTACTTCAC CCTGATTATC GGTGAACTTG CCCCGAAAAA AATAGCCCTG CAACACCCTG AAAAAATCGC ATTGTCTGTT GCAAAAATCA TAGACATCAT CTGCAGGGTC ATTGCGCCGA TCGTACACCT GATCAACGGA TCAACCAACA TCGTACTGAA AATCATGGGC ATCAAACCAA CCGAAAAGCC CACGGTAAGC GACGAAGAAG TGATGCTGCT GCTCAAGCAG GGAGCAAAAA AAGGGGTGTT TGAATCGGTC GAATACGATA TGGTTTCACG AATTTTCCGG ATGAGCGACA AACGGGCAAA TTCGATGATG ACCCCCAAGA GCGAAATAGA GTGGCTGGAT CTTTATGCCA CCGAAGAAGA GCTCATTTCG AAAATGCAGG CCAGTGGCCG ATCGAGATTT CCTGTCTCAG AAGGCAGTCT CGATAACCTG AAGGGAGTCG TTCGCTCGCT CGATCTGGTC AACAAGCAGC TCCTGAGCCA GGGCAATCTG AAGGATGCCA TCCGCAATGC GATGAAAGCC CCGCTCTTTG TTCCTGAATC GATCCCTGCG TTTCAGGTTC TCGAACTTTT CAAGGAAAAC CGGGCTCACC TTGCACTGGT TGTCGATGAA CAGGGTTCGG TGCAGGGAGG AATAACAATC ACCGATGTCC TTGAAAGCAT TGTAGGCGAT ATTCCGGCCG ATGACATCGA AGGAAACCGC AAAATCGTAC GCCGGAGTCA GCGGACATGG ATCATTGACG GACTGCTGCC GGTCGATGAT TTCATTCAGG AATTCCATCT TGAAAACTTT CTGGATGAAG ACAATCCGCT CTATGATACC ATGGGGGGGT TCATGATGAC GAAACTTGAA AAAGTCCCTT CTGTCATGGA TATACTCGAA TGGCAGGGGA TACTCTTCAA AGTCATTAAA ATGAATAAAC AGCGGGTAGA CAAAATCCTG GCTGTTTTCA ATAACGACGC CCACGATAAA GCGTCAAAAT ACGATACGAA ATGA
|
Protein sequence | MDSDIFELFI LLCLILANGF FSMAEFAIIS SRETKLHELH EAGVSRAGLV LELLDNPGKF LSAIQVGITL IATLAGAFSG ITLSAPIAEM IERADALKPY SNELALGLVV IGVTYFTLII GELAPKKIAL QHPEKIALSV AKIIDIICRV IAPIVHLING STNIVLKIMG IKPTEKPTVS DEEVMLLLKQ GAKKGVFESV EYDMVSRIFR MSDKRANSMM TPKSEIEWLD LYATEEELIS KMQASGRSRF PVSEGSLDNL KGVVRSLDLV NKQLLSQGNL KDAIRNAMKA PLFVPESIPA FQVLELFKEN RAHLALVVDE QGSVQGGITI TDVLESIVGD IPADDIEGNR KIVRRSQRTW IIDGLLPVDD FIQEFHLENF LDEDNPLYDT MGGFMMTKLE KVPSVMDILE WQGILFKVIK MNKQRVDKIL AVFNNDAHDK ASKYDTK
|
| |