Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1520 |
Symbol | |
ID | 4029220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1731954 |
End bp | 1733252 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637966707 |
Product | putative aminopeptidase 2 |
Protein accession | YP_573572 |
Protein GI | 92113644 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1362] Aspartyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00169382 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCACG CCCCCACCCT TGACCGTTTA CTGCATTTTC TCGAGCGCTC GCCCACACCC TGGCATGCCG TCGACAACAT GGCTCGGCGG CTCGAACAGG CAGGGTATCG GCGACTCGAG GAAACCGAGG CGTGGCAATT GGCGCCCGGT GATCGTTTCT ACGTCACGCG CAACGCGTCG TCATTGATTG CCATGCAGGT GCCGACGGAC CCCTTGAGCG GGCTGCGCAT GATCGGGGCG CATACCGACA GCCCGGGGTT GCGGTTGAAA CCCCAACCCG TGGTGGCCAA GAAGGATTGG CTGCAGTTGA GCGTCGAGGT CTACGGCGGT GCGCTGCTGG CACCATGGTT CGACCGCGAT CTGGGGCTGG CCGGGCGCAT CCATGTACGA CGCGAGGATG GACGCTTGCA GGGCGTATTA TTGCATGTCG ATCGTCCCGT CGCGATCATT CCCAGCCTGG CCATCCACCT GGATCGCGAG GCCAACAACG GGCGTGCCCT GAATGCCCAG ACGCAGATGC TGCCGGTCGT GCTGCAAGGC GGTGGCGAAG CCGATCTCGA GCGCTGGCTC AAGCGCTGGC TGTACGAACA GCATGGGCTG GAGAACATTC AGTTGTTGGA TTACGAACTC TCGCTTTACG ACATGCAGCG GCCGTCGCGT GTCGGGATCG AGGGGGAACT GATCGCCAGT GCGCGCCTCG ACAATCTGCT GTCGTGCTTC ACCGGTATCG AGGCCTTGCT GGCCGGCGAC GGGCGACAGG GGGCGCTCTT CGTCGCCAAC GATCACGAAG AAGTGGGCAG TGCCAGTGCG TGCGGCGCCC AGGGCCCCTT CCTGGGAGAC GTGCTGCGTC GCGTGCATGC GCAACTGGGT GAGGGCGGCG AAGACGGCTG GGTGCGTCTG ATCCAGGGCT CGCGCATGAT TTCCTGCGAC AACGCCCATG CCGTGCACCC CAACTTTCCC GAGAAACACG ACGAACACCA CGGCCCGGCG ATCAATGGCG GGCCCGTGAT CAAGGTGAAC GCCAACCAGC GCTATGCCAC CAACAGCGCG ACAGCGGCCA TGTTCCGGGA TATCTGTCGC GAGGCAGGAA CGCCCGTGCA GACCTTCGTG ACGCGTGCCG ACATGGGCTG CGGCAGCACC ATCGGGCCCA TCACCGCCAC CGAACTCGGG GTGCCGACGC TGGATGTGGG TATCCCGCAG TGGGGCATGC ACTCGATTCG CGAAACCGCC GGCAGCCGCG ATGCCGATTA CTTGATCCGC GCGCTGACGG CCTTCGTCAA TCGCACCGAG CTGGACTAG
|
Protein sequence | MAHAPTLDRL LHFLERSPTP WHAVDNMARR LEQAGYRRLE ETEAWQLAPG DRFYVTRNAS SLIAMQVPTD PLSGLRMIGA HTDSPGLRLK PQPVVAKKDW LQLSVEVYGG ALLAPWFDRD LGLAGRIHVR REDGRLQGVL LHVDRPVAII PSLAIHLDRE ANNGRALNAQ TQMLPVVLQG GGEADLERWL KRWLYEQHGL ENIQLLDYEL SLYDMQRPSR VGIEGELIAS ARLDNLLSCF TGIEALLAGD GRQGALFVAN DHEEVGSASA CGAQGPFLGD VLRRVHAQLG EGGEDGWVRL IQGSRMISCD NAHAVHPNFP EKHDEHHGPA INGGPVIKVN ANQRYATNSA TAAMFRDICR EAGTPVQTFV TRADMGCGST IGPITATELG VPTLDVGIPQ WGMHSIRETA GSRDADYLIR ALTAFVNRTE LD
|
| |