Gene Csal_0144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0144 
Symbol 
ID4027285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp166260 
End bp167510 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content62% 
IMG OID637965295 
Producthypothetical protein 
Protein accessionYP_572207 
Protein GI92112279 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCCG TTGCCATGAC ATCGACACCC ATTCTCGCAT GCTGGACTCT CTCGCTATTC 
GGCGCCGCAT CGATCGCCCA CGCGGCGCCT GCCGACGAGG CGACGCCTCC CGTCAGGGAA
GACGCTCCCA CGACACCGCA AACGGCCCAG AGCTGGGCCG ATGCCCTGGA ATTGGGTGGA
GCGCTGCGCT TCAATCATCG CTACGAGGAC TGGTCGTCGA GCGACAAGCA GCAAGGCGGT
GGCGATATCG ATTTCGATGC CTTCTACCTC GACCTGGAAG CGGAAAAGGA CGACCTCTTC
CTGGATCTTT CTTATTGGTT CAAGGACAAC GACGTCCGTG TCCTGGAGCA CGGCTTCTTC
GGCTACCGCT TTTCTTCCCG CTCGCGCCTG GAAATGGGGG CCACGTTCGA GCCGTTCGGT
ATCATGCCGT ATCCGCAATT CGGCTGGACC TTCAACATCC CCTTCTACCT GGGGATGGGT
CACAACACCG CGCTGGGGGC CAAATACGTC TACGAAGGTC CCGAGTGGGA GGCGCAGGTC
GGCTTCTTCA AGAACCCGCT GTCGCTGGAT ACGCGTTATG CGCCTAACAT GGCATCCGCC
GATGACGTCG ACGACGCCTT CCTTGCTCCC ACCAACAGCG GGCAAGCCAA CGAGAAGCAG
AATCAACTGA GTGGCCGCCT GGTGCGGACC TTCCAGGGCG ACGGATGGGA AAGTCGACTG
GGGGCCTCGG CATACGTCGG ACAGTTGCAC AACGACACGA CGGACCGTAA CGGCAGCTAC
TGGGGCACGG AACTGCACGC CTTGACGACC TTCGGTCCTT GGCAGGTGCA GCTCCAGGGC
ATTCGTTATG TCTTCGACCC CGAGAATCCC GAGGGCGTGA GCGACGACAG CGTGCTCTTT
GCCGGCCCGG GGACACCGAG TTACCGCGTG GCCGCGAAAG GGACCGTGGG GGTGCTCAAC
ATTGCCTACG ACTTGCCGAC GCCGCGCCTC GGCCCGGTCA AGAAGCTGCG TTTCTACAAC
GACTACAGTC GGCTGGTGAA AGACCGCAGT GGTTGGGACG ACTCGCAGAT GGAAACCGTC
GGTGTACAGT TCTTCGCCTT GCCGGTGATG GGGTGGCTGG ATGTCACCTG GGGCAAGAAC
ATGAACATGA TGGGCGGCAT GCCCGGCGGT GTCGGGCTGG CCTCGGCGGA TGCGGAGGGC
AGTGGCGAAT GGGAGCTACG CACCAACCTC AATATCGGCT ATTACTTCTA G
 
Protein sequence
MNPVAMTSTP ILACWTLSLF GAASIAHAAP ADEATPPVRE DAPTTPQTAQ SWADALELGG 
ALRFNHRYED WSSSDKQQGG GDIDFDAFYL DLEAEKDDLF LDLSYWFKDN DVRVLEHGFF
GYRFSSRSRL EMGATFEPFG IMPYPQFGWT FNIPFYLGMG HNTALGAKYV YEGPEWEAQV
GFFKNPLSLD TRYAPNMASA DDVDDAFLAP TNSGQANEKQ NQLSGRLVRT FQGDGWESRL
GASAYVGQLH NDTTDRNGSY WGTELHALTT FGPWQVQLQG IRYVFDPENP EGVSDDSVLF
AGPGTPSYRV AAKGTVGVLN IAYDLPTPRL GPVKKLRFYN DYSRLVKDRS GWDDSQMETV
GVQFFALPVM GWLDVTWGKN MNMMGGMPGG VGLASADAEG SGEWELRTNL NIGYYF