Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0894 |
Symbol | |
ID | 5055979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 792770 |
End bp | 794548 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640468454 |
Product | carbon starvation protein CstA |
Protein accession | YP_001153130 |
Protein GI | 145591128 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG1966] Carbon starvation protein, predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.355401 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAATA CCCCAGGTCC GTACATATTA ATTGGGCTTG TGGCTTATTT TTTGACGTAT CTATTCTACG CTAGGTGGGT TGACAAAAAG ATATGGGAAA CCGATCCTAA CAGGCCGACG CCTGCTAGGC TATACTTCGA CGGCGTAGAG TACTTCCCCG TGTCTAAATA CGTGCTATTT GGATACCAGT TTAAATCAGT GGCGGCGCTG GGGCCCATAG TAGGCCCCCT CACCGCAGTG CTCTTCTTCG GCTGGGTGCC GGCGTTGCTG TGGGTAATCC TGGGCAACAT GTTCATCGGC TGGGTACAAG ACTACAGCGC AATGATGATG TCGCTGAGAA ACGAGGGGAG GTCGATGGGC CCCATAACCT ATAAGCTACT GGGCGATAGA GCGAGAAAAA TCTTGTTGAT ATACCTCATA TTCTACCTAA TAATTATAAC CGCAGTTTTC GAGTGGGTTA TCATTGACGT GTTGAACAGA GTGCCGGGGA CCTTCACCGC TGTGCTCTTT GTCCTCCTCG GCGGCGTTGT GTTCGGCGCG TTAGTTTTCC GCATGAGAAT GGATGTCCTC ATCGCCACTG TGGTGGCTCT CGGCATTGTG CTCGTGGGCT ACTTCCTTGT GACGCTTGTG CCGGCAGTCA GGGCGCCGGG GACCAACTTC CTTGACCCAC AGGACTTCTT CAAAGCACAC AACTTCAACC CCTCGCTTAC CTATCCCGGT ACAAACACCG TCCTCTTCTG GCTATTAGTC CTATCGGTGC TTTACTACAT TGCGGCAGTG ACGCCGATGC CTAGGTTCCT CCTGCCCACG GTCTACGTGG GCTATCTGCC CAGCATCATA GCGCTTGTTC TAGTGCTAAT AGCCGCAATA TTCACCCCGC TTACCGGACT GACCATACAG CAGACGCCAA TGAAAGCCCT TTACGTAGAT CCCCTACAGA ACGCGCAAGG CGGGCCTCTC TGGCCAATCC TATTTGTAAC CATTGCGTGT GGAGCCATAT CGGGGTGGCA TAGTCTTGTC TCCTCTGGGC TGACTCCCAA GCAGCTCGAG TACGAGACGG ATGCCCTACC TGTTGGGGGC GGCGCGATGA TGACCGAGGG CGCCGTAGCT CTCTCCTCCA TAGCCGCCGT CATGGTGCTT TCGCAACCGC CTGCGGGCGC CGCGGCGTAT GTACAAGGTG CAACGCTCCT TACAACTAAG CTACTACAGG TTCCCGACGT ATATATGAAC ATTCTCTACG GCATATTCGT AACTGTGATG GGTCTCATTA CCTCAATGCT CTTCGTAAGG GTCTTCCGCC TCATCATGGC AGAGCTTTTC GAGGAGAGCC CCCTGGGCAA CAAGTTCATA TCACCAATTC TAATCCTAAT AATAGCAGGC TTTCTTGCCT TTGTCGGAAG CTGGACCAAC CTCTGGATCT TCTTCGGCGG CACCAACCAG CTTCTCGCCG GGCTGGCGCT TCTGTTAGTC GCCATATTCC TAGCCAGCGT GAAGAAGCCC ACTGCCTACG TTTTCATACC GGGTATCTTC ATGGCTATCA CAACGCTGGC GGCGCTGGCA TGGGAGACCT ACGTCTACGG CCTCTACGCC GCGATGAATA AGCCGATAGG CGTGCAGGCC GCGGCTGCTG CCCTATACGG GAACTGGATT GTCTCTGTGT CGAACTACAT CTCGGCGGCC TTTGGAGCGC TACTCTTAAT CCTGGGCGCA ATAACGACCT ATTACTTAAT CACCGGATGG ATCAAGTATA GACGGGGCGA CACAAAAGTA TTTAAATAA
|
Protein sequence | MLNTPGPYIL IGLVAYFLTY LFYARWVDKK IWETDPNRPT PARLYFDGVE YFPVSKYVLF GYQFKSVAAL GPIVGPLTAV LFFGWVPALL WVILGNMFIG WVQDYSAMMM SLRNEGRSMG PITYKLLGDR ARKILLIYLI FYLIIITAVF EWVIIDVLNR VPGTFTAVLF VLLGGVVFGA LVFRMRMDVL IATVVALGIV LVGYFLVTLV PAVRAPGTNF LDPQDFFKAH NFNPSLTYPG TNTVLFWLLV LSVLYYIAAV TPMPRFLLPT VYVGYLPSII ALVLVLIAAI FTPLTGLTIQ QTPMKALYVD PLQNAQGGPL WPILFVTIAC GAISGWHSLV SSGLTPKQLE YETDALPVGG GAMMTEGAVA LSSIAAVMVL SQPPAGAAAY VQGATLLTTK LLQVPDVYMN ILYGIFVTVM GLITSMLFVR VFRLIMAELF EESPLGNKFI SPILILIIAG FLAFVGSWTN LWIFFGGTNQ LLAGLALLLV AIFLASVKKP TAYVFIPGIF MAITTLAALA WETYVYGLYA AMNKPIGVQA AAAALYGNWI VSVSNYISAA FGALLLILGA ITTYYLITGW IKYRRGDTKV FK
|
| |