Gene Csal_1121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1121 
Symbol 
ID4029147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1278388 
End bp1279917 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content57% 
IMG OID637966298 
Productpeptidase U34, dipeptidase 
Protein accessionYP_573176 
Protein GI92113248 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4690] Dipeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000424745 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAC AAACACTCGC CACAGTATTG GCCTTAACGG TATTTGGCAC CTGCACGTCC 
GCGTGGGCGA GTCATGCGTT CTATGTTGGC AAGAATCTAA CGGAAAGCGG CAATGTGCTG
GTGGGCGGCA CCGGGGAAGA GGTATCGAGC CATTGGCTGG AAATCGTGCC TGCCGCTGAC
CATGAGCCCG GGGAGACGAT CGACGTTGGG GTGACCGAAG CCGCATGGAT CCCCGGCGAA
CTGATCGAAA TCCCTCAAGT CGAGCATACC TATCGCTACC TCTCGATGTC TTATTCAGAC
TATGAGGGCT TCCCGGCGCC GCTGACCAAT GGCGGTGTCA ATGAGCACCA AGTGGCGGTA
CGCGATGTAT GGGCCACGGG CCGCGAGGAG CTTATCGATG CCACCGAGAC GCCACAAAAG
GGCGTTCAGT ACAGCGACCT AGCCCGTCTG GTATTGGAAA GGGCCAAAAC CGCACGTGAA
GGGGTCAAGC TGATTGGCGA ACTGATCGAT GAGTATGGCT ATGCCACCTA CGGCGGCAAT
ACCCATTTGA TTGCCGATCC CGATGAGGCC TGGGTGGTCT GGGAACTGCC CGGTAGCCAG
GGACTTTGGG CCGCCGAACG CTTGGGGCCA GATGATGTAC GGGTGTTGTA TCCCGGTTAT
ATCGAAGATT TCCCACAGGA TTTCCAGAAC GACTCCAACT ATATGGGATC TGATAATCTC
GTTTCCTACG CGGAGGATAA AGGCTGGTTC GACGCGCAGG GCGATGAGTC TTTCAATATC
TTCGACGTCT ATGGACGTCA GGACACTCAA GCGCGTACCG GCGGCTACAA GTACATGAGC
CAGGCGGAGC TGGAGAAAGC CACGCGTGAC ATGGCGCCGG TCTCCGAGCA GGACATGATT
ACGCGGGTAC GGGATCATCG GATCTCCGAC GATGAAGCTG GTTATGGTCA GGTAGTCTCG
CTGGAACAAG GACGTGATCC CGACATGGTG CGTATTTGGG TGGCACCGAC TGGCTCGGTT
GCTGCGCCTT ACATCCCATG GTGGCTAGGC GTGCAGAAAG TTCCTGCTTC CTTCGCGCAG
CATCGCTACC TTACCAAGGG GGCAGGATCC CACTTCCTCA ATCCCGATTA TGCCATGCAG
GAAGCAAGCG ACTTCGCGGG TCGTCGCTTC AAGCAGGTCA TGTACTACAT GTGCGAAGAC
CCGGAAACCT TCCGCCCCAC CGTGCAGCGT ATGCTAAAGG GATTCGAGCA GGAGAGTTTC
GATGACATCC AATGGGTCGA GGAATCCGCA CGTACGCTCA TCGAACAAGG CAAGCGCGAG
CAGGCCCAGT CATTGTTGAC GTTCTATTCC TATACCCGCG CCAGCGATGC CATGAACTTG
GGAGATACCT TGGTGGATAG CCTGACGGCT TATAGCCAAC TGGTAACGGG TGAGCGTCTC
CCCAAAGGCG AACATATCAA CGACCAGGGC GGGGAGACCG TAAACTACTT GGTAGGTGCC
GATCCTGACA AGCCCCAATC AGCACAATAA
 
Protein sequence
MKRQTLATVL ALTVFGTCTS AWASHAFYVG KNLTESGNVL VGGTGEEVSS HWLEIVPAAD 
HEPGETIDVG VTEAAWIPGE LIEIPQVEHT YRYLSMSYSD YEGFPAPLTN GGVNEHQVAV
RDVWATGREE LIDATETPQK GVQYSDLARL VLERAKTARE GVKLIGELID EYGYATYGGN
THLIADPDEA WVVWELPGSQ GLWAAERLGP DDVRVLYPGY IEDFPQDFQN DSNYMGSDNL
VSYAEDKGWF DAQGDESFNI FDVYGRQDTQ ARTGGYKYMS QAELEKATRD MAPVSEQDMI
TRVRDHRISD DEAGYGQVVS LEQGRDPDMV RIWVAPTGSV AAPYIPWWLG VQKVPASFAQ
HRYLTKGAGS HFLNPDYAMQ EASDFAGRRF KQVMYYMCED PETFRPTVQR MLKGFEQESF
DDIQWVEESA RTLIEQGKRE QAQSLLTFYS YTRASDAMNL GDTLVDSLTA YSQLVTGERL
PKGEHINDQG GETVNYLVGA DPDKPQSAQ