Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1423 |
Symbol | |
ID | 5055966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1283732 |
End bp | 1284946 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640468964 |
Product | citrate transporter |
Protein accession | YP_001153633 |
Protein GI | 145591631 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1055] Na+/H+ antiporter NhaD and related arsenite permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.418451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0399282 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGGCCA GGCCGCTGTA CCCCAAGCTA CCCACGTGGT CTCTTATGTC CCTCGCCGCT TTTATAGCGG TGTTTTTTGG GCCCCTGGGC GTAGACGACG TGCCGCGGGT TGTAAACTTC GAGGTGCTGC TCTTCCTAAT AGGCATGTTC TCCATAGTCG CCTTGGCCGA GTCAAGCGGG TTGTTGGACG CCTTCGCCTA CTGGTTCGTC TCGCTACTCA GATCAAGGCT GTCCATATTC GTCGGGAGCT CGTTGCTCTT CGGCCTCCTC TCGGCCATAG CCGTAAACGA CACAGTGGCC CTCATGGGCC CCCCACTCGC CGCCGCAGTG GCTAGAGCCG CAGGCATTGA GTACAGGCAC ATGTTCCTCC TCCTGGCCTT CTCGCTGACC ATAGGCTCCG TGATGACGCC CATAGGCAAC CCCCAGAACA TGCTAATCGC AGTGGAGTCC GGCATGGCTA CGCCGTTTAT CACCTTCCTC CGCCACCTCG CCATCCCCAC GCTGATAAAC CTCGTGGCCA CCCCCCTCCT CCTGTTTAAG CTATTCGGCA TAAAGAACGA GAAAGTGCGC TACGTCGCTG TGGCGCGGGA GCACATAAGG AACAGACGCG ACGCCGCGGT GGCCGCGGTG GTGTTGTTAG GCACCGTAGC CGCAATACTG GCCAACGACC TCGCCGCGCT CTCGGGGCAC CCCCACATCA AGAACATCGG CTTCATCCCC TTCGTCGCCG CGTCTCTGCT CTACTTCTTC GCCACCTCGC CCAGGGAGGT TCTGGCCAAG GTCGAGTGGG GCACAATCAT CTTCTTTATC GCGATGTTTA TTACAATGGC TACTATCTGG CATGGCGGCG TGTTGCAGCC CCTCACCTCC GCCTTGCTCC CCAGCTACTC CGGCTCTGCC CTGGATCTTT TGGCCATCAC GGCCTTGTCC ATTGCGCTGA GCCAAGTTCT GAGCAACGTG CCTTTTGTGA GCTTGTTCTC CACGTATCTC CACGAGCTGG GAGTCGCAGA CCCTAAGGCT TGGGTCGCCC TCGCCATGGC CTCCACAATT GCCGGCAATC TCACCCTCCT AGGCGCCGCC TCAAATATCA TAATCCTGGA GGTGCTCGAA ACCAGGTTCG GCGCCACCAT AACTTTCCTC CAATTCTTGA AATACGGCGC CTTAGTAACC GCCCTAAACC TCGCCGTCTA CCTACCGTTC CTCCTACTTG CATGA
|
Protein sequence | MLARPLYPKL PTWSLMSLAA FIAVFFGPLG VDDVPRVVNF EVLLFLIGMF SIVALAESSG LLDAFAYWFV SLLRSRLSIF VGSSLLFGLL SAIAVNDTVA LMGPPLAAAV ARAAGIEYRH MFLLLAFSLT IGSVMTPIGN PQNMLIAVES GMATPFITFL RHLAIPTLIN LVATPLLLFK LFGIKNEKVR YVAVAREHIR NRRDAAVAAV VLLGTVAAIL ANDLAALSGH PHIKNIGFIP FVAASLLYFF ATSPREVLAK VEWGTIIFFI AMFITMATIW HGGVLQPLTS ALLPSYSGSA LDLLAITALS IALSQVLSNV PFVSLFSTYL HELGVADPKA WVALAMASTI AGNLTLLGAA SNIIILEVLE TRFGATITFL QFLKYGALVT ALNLAVYLPF LLLA
|
| |