Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1121 |
Symbol | |
ID | 4029147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 1278388 |
End bp | 1279917 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637966298 |
Product | peptidase U34, dipeptidase |
Protein accession | YP_573176 |
Protein GI | 92113248 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4690] Dipeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000424745 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGAC AAACACTCGC CACAGTATTG GCCTTAACGG TATTTGGCAC CTGCACGTCC GCGTGGGCGA GTCATGCGTT CTATGTTGGC AAGAATCTAA CGGAAAGCGG CAATGTGCTG GTGGGCGGCA CCGGGGAAGA GGTATCGAGC CATTGGCTGG AAATCGTGCC TGCCGCTGAC CATGAGCCCG GGGAGACGAT CGACGTTGGG GTGACCGAAG CCGCATGGAT CCCCGGCGAA CTGATCGAAA TCCCTCAAGT CGAGCATACC TATCGCTACC TCTCGATGTC TTATTCAGAC TATGAGGGCT TCCCGGCGCC GCTGACCAAT GGCGGTGTCA ATGAGCACCA AGTGGCGGTA CGCGATGTAT GGGCCACGGG CCGCGAGGAG CTTATCGATG CCACCGAGAC GCCACAAAAG GGCGTTCAGT ACAGCGACCT AGCCCGTCTG GTATTGGAAA GGGCCAAAAC CGCACGTGAA GGGGTCAAGC TGATTGGCGA ACTGATCGAT GAGTATGGCT ATGCCACCTA CGGCGGCAAT ACCCATTTGA TTGCCGATCC CGATGAGGCC TGGGTGGTCT GGGAACTGCC CGGTAGCCAG GGACTTTGGG CCGCCGAACG CTTGGGGCCA GATGATGTAC GGGTGTTGTA TCCCGGTTAT ATCGAAGATT TCCCACAGGA TTTCCAGAAC GACTCCAACT ATATGGGATC TGATAATCTC GTTTCCTACG CGGAGGATAA AGGCTGGTTC GACGCGCAGG GCGATGAGTC TTTCAATATC TTCGACGTCT ATGGACGTCA GGACACTCAA GCGCGTACCG GCGGCTACAA GTACATGAGC CAGGCGGAGC TGGAGAAAGC CACGCGTGAC ATGGCGCCGG TCTCCGAGCA GGACATGATT ACGCGGGTAC GGGATCATCG GATCTCCGAC GATGAAGCTG GTTATGGTCA GGTAGTCTCG CTGGAACAAG GACGTGATCC CGACATGGTG CGTATTTGGG TGGCACCGAC TGGCTCGGTT GCTGCGCCTT ACATCCCATG GTGGCTAGGC GTGCAGAAAG TTCCTGCTTC CTTCGCGCAG CATCGCTACC TTACCAAGGG GGCAGGATCC CACTTCCTCA ATCCCGATTA TGCCATGCAG GAAGCAAGCG ACTTCGCGGG TCGTCGCTTC AAGCAGGTCA TGTACTACAT GTGCGAAGAC CCGGAAACCT TCCGCCCCAC CGTGCAGCGT ATGCTAAAGG GATTCGAGCA GGAGAGTTTC GATGACATCC AATGGGTCGA GGAATCCGCA CGTACGCTCA TCGAACAAGG CAAGCGCGAG CAGGCCCAGT CATTGTTGAC GTTCTATTCC TATACCCGCG CCAGCGATGC CATGAACTTG GGAGATACCT TGGTGGATAG CCTGACGGCT TATAGCCAAC TGGTAACGGG TGAGCGTCTC CCCAAAGGCG AACATATCAA CGACCAGGGC GGGGAGACCG TAAACTACTT GGTAGGTGCC GATCCTGACA AGCCCCAATC AGCACAATAA
|
Protein sequence | MKRQTLATVL ALTVFGTCTS AWASHAFYVG KNLTESGNVL VGGTGEEVSS HWLEIVPAAD HEPGETIDVG VTEAAWIPGE LIEIPQVEHT YRYLSMSYSD YEGFPAPLTN GGVNEHQVAV RDVWATGREE LIDATETPQK GVQYSDLARL VLERAKTARE GVKLIGELID EYGYATYGGN THLIADPDEA WVVWELPGSQ GLWAAERLGP DDVRVLYPGY IEDFPQDFQN DSNYMGSDNL VSYAEDKGWF DAQGDESFNI FDVYGRQDTQ ARTGGYKYMS QAELEKATRD MAPVSEQDMI TRVRDHRISD DEAGYGQVVS LEQGRDPDMV RIWVAPTGSV AAPYIPWWLG VQKVPASFAQ HRYLTKGAGS HFLNPDYAMQ EASDFAGRRF KQVMYYMCED PETFRPTVQR MLKGFEQESF DDIQWVEESA RTLIEQGKRE QAQSLLTFYS YTRASDAMNL GDTLVDSLTA YSQLVTGERL PKGEHINDQG GETVNYLVGA DPDKPQSAQ
|
| |