Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0069 |
Symbol | |
ID | 5537528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 85876 |
End bp | 88785 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640892235 |
Product | hypothetical protein |
Protein accession | YP_001430225 |
Protein GI | 156740096 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0314244 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000262699 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGCAGG GAGCGATTCG CAACGGCGCC CCCTCGCCGC CTGCGACCCG TCCGCCGCGC CCCGACCTGC CATTGCACTT GGTGCGCGCA CTGGCGTTTG CCTGCATTGC CCTGGCGTTG ATGCGTCCGC TGGCGTGGCC GCTGATCGCG GCATTCCCCT TCGCGTCGCT GACGGTGTTG CGTCTGATGA ATGATCCGGC GCTGGCAGCG CTGGCGCTGT TGCATGTCGT GCTGGCGCCG CCAGTCTTTC CGGCGCTGGT CATTGGCGCC CTGGCAGCGC TGTGGGTGCT GTTGGGCGAA CTCGTCTTCG CCCTGGCAAC TGCGCTTCTT GCACGCTATC GGGCGCAGCG CGCGCTGCGT GCGCCGCTGT GTCTCCAGAT TCGTCCGACC GCTTCAGCGC GCACCGGAAC CCCTGCGAAA CCGGGTGCAC TCATGCGATT GATCCACGGG GCTACGGTGG CGCGTTCCTG GATGCACGCC GCACCCTGGT ATACTCTGCT GGTCAACGGG GCGCCCGACC TGCCTGCCGA ACTGGGGGCG TTGATCGCCG GTGCATCCGA GGAGCGCCCA CGGACCGTCA CCGCACTGGA TGGCGCAGTG CGCAGCAGCG TACCGGAAGC GCTGATTCAC GCCGCCGCCG ACCCGCTGCT GGCAGCGGCA ACGCCGGGGC GTTGGATTGC CTGGCAGCGC TTCGGGCTGG CGCTGCCGCC CGCCTATCCA CTTCACGCAC CAACCATCGC CATCGAGTCG GAACTGACCG GGGTGTTGCT GGCCGCAGTA CGTCCGCAGG CCAGCGTGGC GCACGCCGGT CTGGAAGTGG CGTTGCGCCC TCAGGTGGGT TGGGAGTTGG GTCGGCAGTG GCGCGCGCGC GCGACAACGC TGAAACTGGC GCTCGAACAG CGTCAGGACT ATGCGTTGTC GCCGGATGTG GCTGCAATCG AGGCAAAATT GGGCGATGCG GCGTTCGAGG CGACCATTGT GACAACTGCG GTCGCCGATC AGCGCGCTGA TGCCATTGCT GCGCTGCTCG CCATCGGCGA TGCGCTCGGC GCCTTCCAGC AACGCACCGC CAGTCGTGTG CAACGTCTCG TTCCGCACGG GCGCATCTCG GTGCGCCGGG TATCTGAAGG GAACGCTGCG GACACGATTA TCCGCCTGCG CACGCCACGC ATTGCGCCGC CCCCGGCGCT CCTGTTACCG TTCCGCCTCT GGCGCGGACC GGACGTGCTG ACAGCGGGGG AACTCGGTTA TCTCTGGAAT CCGTCCGCTC TGCCGGCGAG CGGACTGGTG CGCAGCGATC CGTGCCGTCG GATTGCGGCG CCGCCGCACG CCTTCTGCGG CGCCGACCCG GAACGCATTG TGGTCGGCTA TGCCTCCCAC GCTGATGGAC AACGCGCGCC GGTCGGACCG ACGCTGCGCG ATCTGCGCCA GATACTGCAC CTGACGGCCG GCATGGGCGC CGGGAAGAGC CGGTTGCTGG CGAACCTGTG CCGGCAACTC GTTCCGCGCG GATTTATGCT GATCGACGGC AAGGGGGACG ACCGGGACGG CAGTCTCGTG GCAGTGGTGC GTCGGCTCAT CCCGCCGGCG GACGACGCGC GACTGGTGCT GCTCGACCCG CTCGATACCG CCTGGCCCAT CGGGCTGAAC CCGCTGGCGG GCGTGGATGC CCGGCGTCCG GGAGGCGCCG ACCTGGCGCT GGGTCAACTC CTTGCAACGT TCGCGCGCAT CGATCCCGGC GCCTGGGAAC GGTCGCCGGG GATGCAGCAA TTCGCGCGCA TGGCAGCGTT GTTGGTGCTG GAAGGCGAAG CGCACCCCAC GCTGGCGCAC GTCAAACAGG CGCTCCTCGA CGAAGCGTAC CGGGAGGAAT TGCTCCAATC GACGCACAAC ATCGAAGTCG CCAGTTTCTG GCGCGAGACG TACCCGCGCC TGGGAGAAGG GCAACGATCC AGTTGCGATG CGCTGTTGCG GCGATTCGAC GCCCTGCTCA CCGCAGAAAC GACGCGCTAC CTGGTTGCGC AGGCACAACC GACGCTCGAT CTGGCGCGCA TGATGGCCGA CCGTATGATC GTGCTCGCGC CCTTGCCCGA TGTGACGCTG GGTGGTTTGG CGGGCGCAGT GGGAATGCTG ATCGCCCAGG CCTTCGTGCG TGCTGCCTTC AGTCGGGGCG GCGATGACCA GACTCGCCAC GACTATCCCC TGATCATCGA CGAATTGCAG GTGTTAATTG GCGCCGGCGA CACAACCGAC ATAGCGACTG CCATCACGCG CCTGCGGTCC CTGGGCATCC CGACGATCTA CGCCCATCAG GCATTGGCGC AGTTAGGCGA TCTGCGCGAC CTGATGCTGA TCAATGCCGG GAACCGCATT ATGCTGCAAA CCCAGGAGCC GGATGCCAGC GTGTATGCGC GCGCCTACGC CGCCAGCGGG CTGACCGCTG CCGACCTGAG TGGGCAACCG CCGAACGAGC ATCAGTACGC GGTGTTGCGC TGCGGCGGGC TGGTCGCAGG ACCATTTTCG ATGCAACCGC TGCCCTGGCC GACGGTGGAG GAGGAGGCGC CGCCGCCCTA CGTCGGACCG GCATGGCGCG ATGTTCTTCC CGATGATGGC GATCCGGCGG ATCGCTTTAT TGCACAGGTG ATCTACACGG CAGACGACAG CGCCGGATCT GCCCGCGAAC TGGCGCGGCT TGATGAGGCG GACTGGGAAC GACTGCTGCG GCGCTGGGAG TGCATCCGCG CGCAGCAGCG CCAGTACATT CTGGCGCATC CCGGTTGCAT TCCTGACCGG CGGGAGCGAC AACGCTGGCT CTCGCGGCTG TATGCAGCGC GTCCGCGAGT GCTGGCGGCT GCTGAGTACC TGCGCGGACG CCAAAAAGGA TCGCATCAGA AAGCATTAAG CAAGATTTGA
|
Protein sequence | MKQGAIRNGA PSPPATRPPR PDLPLHLVRA LAFACIALAL MRPLAWPLIA AFPFASLTVL RLMNDPALAA LALLHVVLAP PVFPALVIGA LAALWVLLGE LVFALATALL ARYRAQRALR APLCLQIRPT ASARTGTPAK PGALMRLIHG ATVARSWMHA APWYTLLVNG APDLPAELGA LIAGASEERP RTVTALDGAV RSSVPEALIH AAADPLLAAA TPGRWIAWQR FGLALPPAYP LHAPTIAIES ELTGVLLAAV RPQASVAHAG LEVALRPQVG WELGRQWRAR ATTLKLALEQ RQDYALSPDV AAIEAKLGDA AFEATIVTTA VADQRADAIA ALLAIGDALG AFQQRTASRV QRLVPHGRIS VRRVSEGNAA DTIIRLRTPR IAPPPALLLP FRLWRGPDVL TAGELGYLWN PSALPASGLV RSDPCRRIAA PPHAFCGADP ERIVVGYASH ADGQRAPVGP TLRDLRQILH LTAGMGAGKS RLLANLCRQL VPRGFMLIDG KGDDRDGSLV AVVRRLIPPA DDARLVLLDP LDTAWPIGLN PLAGVDARRP GGADLALGQL LATFARIDPG AWERSPGMQQ FARMAALLVL EGEAHPTLAH VKQALLDEAY REELLQSTHN IEVASFWRET YPRLGEGQRS SCDALLRRFD ALLTAETTRY LVAQAQPTLD LARMMADRMI VLAPLPDVTL GGLAGAVGML IAQAFVRAAF SRGGDDQTRH DYPLIIDELQ VLIGAGDTTD IATAITRLRS LGIPTIYAHQ ALAQLGDLRD LMLINAGNRI MLQTQEPDAS VYARAYAASG LTAADLSGQP PNEHQYAVLR CGGLVAGPFS MQPLPWPTVE EEAPPPYVGP AWRDVLPDDG DPADRFIAQV IYTADDSAGS ARELARLDEA DWERLLRRWE CIRAQQRQYI LAHPGCIPDR RERQRWLSRL YAARPRVLAA AEYLRGRQKG SHQKALSKI
|
| |