Gene Csal_1520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1520 
Symbol 
ID4029220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1731954 
End bp1733252 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content64% 
IMG OID637966707 
Productputative aminopeptidase 2 
Protein accessionYP_573572 
Protein GI92113644 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1362] Aspartyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00169382 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCACG CCCCCACCCT TGACCGTTTA CTGCATTTTC TCGAGCGCTC GCCCACACCC 
TGGCATGCCG TCGACAACAT GGCTCGGCGG CTCGAACAGG CAGGGTATCG GCGACTCGAG
GAAACCGAGG CGTGGCAATT GGCGCCCGGT GATCGTTTCT ACGTCACGCG CAACGCGTCG
TCATTGATTG CCATGCAGGT GCCGACGGAC CCCTTGAGCG GGCTGCGCAT GATCGGGGCG
CATACCGACA GCCCGGGGTT GCGGTTGAAA CCCCAACCCG TGGTGGCCAA GAAGGATTGG
CTGCAGTTGA GCGTCGAGGT CTACGGCGGT GCGCTGCTGG CACCATGGTT CGACCGCGAT
CTGGGGCTGG CCGGGCGCAT CCATGTACGA CGCGAGGATG GACGCTTGCA GGGCGTATTA
TTGCATGTCG ATCGTCCCGT CGCGATCATT CCCAGCCTGG CCATCCACCT GGATCGCGAG
GCCAACAACG GGCGTGCCCT GAATGCCCAG ACGCAGATGC TGCCGGTCGT GCTGCAAGGC
GGTGGCGAAG CCGATCTCGA GCGCTGGCTC AAGCGCTGGC TGTACGAACA GCATGGGCTG
GAGAACATTC AGTTGTTGGA TTACGAACTC TCGCTTTACG ACATGCAGCG GCCGTCGCGT
GTCGGGATCG AGGGGGAACT GATCGCCAGT GCGCGCCTCG ACAATCTGCT GTCGTGCTTC
ACCGGTATCG AGGCCTTGCT GGCCGGCGAC GGGCGACAGG GGGCGCTCTT CGTCGCCAAC
GATCACGAAG AAGTGGGCAG TGCCAGTGCG TGCGGCGCCC AGGGCCCCTT CCTGGGAGAC
GTGCTGCGTC GCGTGCATGC GCAACTGGGT GAGGGCGGCG AAGACGGCTG GGTGCGTCTG
ATCCAGGGCT CGCGCATGAT TTCCTGCGAC AACGCCCATG CCGTGCACCC CAACTTTCCC
GAGAAACACG ACGAACACCA CGGCCCGGCG ATCAATGGCG GGCCCGTGAT CAAGGTGAAC
GCCAACCAGC GCTATGCCAC CAACAGCGCG ACAGCGGCCA TGTTCCGGGA TATCTGTCGC
GAGGCAGGAA CGCCCGTGCA GACCTTCGTG ACGCGTGCCG ACATGGGCTG CGGCAGCACC
ATCGGGCCCA TCACCGCCAC CGAACTCGGG GTGCCGACGC TGGATGTGGG TATCCCGCAG
TGGGGCATGC ACTCGATTCG CGAAACCGCC GGCAGCCGCG ATGCCGATTA CTTGATCCGC
GCGCTGACGG CCTTCGTCAA TCGCACCGAG CTGGACTAG
 
Protein sequence
MAHAPTLDRL LHFLERSPTP WHAVDNMARR LEQAGYRRLE ETEAWQLAPG DRFYVTRNAS 
SLIAMQVPTD PLSGLRMIGA HTDSPGLRLK PQPVVAKKDW LQLSVEVYGG ALLAPWFDRD
LGLAGRIHVR REDGRLQGVL LHVDRPVAII PSLAIHLDRE ANNGRALNAQ TQMLPVVLQG
GGEADLERWL KRWLYEQHGL ENIQLLDYEL SLYDMQRPSR VGIEGELIAS ARLDNLLSCF
TGIEALLAGD GRQGALFVAN DHEEVGSASA CGAQGPFLGD VLRRVHAQLG EGGEDGWVRL
IQGSRMISCD NAHAVHPNFP EKHDEHHGPA INGGPVIKVN ANQRYATNSA TAAMFRDICR
EAGTPVQTFV TRADMGCGST IGPITATELG VPTLDVGIPQ WGMHSIRETA GSRDADYLIR
ALTAFVNRTE LD