Gene Csal_1400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1400 
Symbol 
ID4029064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1588487 
End bp1589557 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content63% 
IMG OID637966585 
ProductPhage portal protein, PBSX 
Protein accessionYP_573454 
Protein GI92113526 
COG category[R] General function prediction only 
COG ID[COG5518] Bacteriophage capsid portal protein 
TIGRFAM ID[TIGR01540] phage portal protein, PBSX family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.691586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACGG CCGACAAACC CCGCATGCGT GTGCCGGCGA CCCTGTCCGA GTCCGAACCG 
GCGGCGGAGG CGTCGCCAGC CCCGGCGCGG GCCGAGGCGT TCACCTTCGG CGAGCCGGTA
CCGGTGACGG ACCTGGCCGA TTTCCTCTAC ACCGGCTGTT GGATGCTGAC GGCGCGCTGG
TACGAACCGC CGGTGGATCT GCCAGCGCTG GCCAAGGTGT ATCGCGCGAC GGCACATCAC
GGTTCCAGTT TGCAGGTGAA GCGCAACATT CTATCGCGAT CGTTCATTCC GCATCGCCTA
CTGAGCCGGC AGGCGTTCCG TGCCCTGGTC ACGGATTACC TGGTGTTTGG CAATGCGTAC
ATCGAGCGGG TGTATGGGCG CCTTGGGCGG CTGCTGGCAT TGCGGCCGGC GCGAGCCAAG
TACGTGCGCC GAGGCGTCGA AGAAGGGCAG TACTGGTGGG TGACATCCTG GCAGGTGGCC
AGCGAATTCG AGCGCGATTC GGTGATCCAC CTGATGGAGC CTGACATCAA CCAAGAGATC
TACGGCGTGC CGGACTACCT CGGCGCGCTG CAGTCGATCC TACTCAACGA AAACGCTACC
TTGTTCCGCC GCAAGTACTA CCTGAACGGT AGTCATGCCG GCTTTGTGAT GTACGTCTCC
GATACCGCGC AGAACCAGGA AGACATCGAT GCCATGCGCG AGGCCCTGCG CAACTCCAAG
GGCGTGGGCA ACTTCCGCAA CCTGTTCCTT CACTCGCCGG GCGGCAAGAA GGATGGCGTA
CAGATCATCC CCATCAGCGA GGTCGCCGCC AAGGACGAAT TCGCCGGCAT CAAGCAGGAA
ACCCGCGACG ACACCCTCGC CAGCCACCGC GTACCGCCCC AGCTGATGGG CGTGATGCCC
AACAATGTCG GAGGATTCGG CGACGTGGAG AAGGCGGCCA GGGTCTTCGT CACCAACGAG
CTCGAGCCGC TGCAAGCCGT CTTCGAAGAG GTCAACGACC TGGTGGGGGA GCAGGTGATC
CGGTTTCGGC CTTACACACT TGAGGCGGCC AGCGAATCGC CAACCGGATA A
 
Protein sequence
MTTADKPRMR VPATLSESEP AAEASPAPAR AEAFTFGEPV PVTDLADFLY TGCWMLTARW 
YEPPVDLPAL AKVYRATAHH GSSLQVKRNI LSRSFIPHRL LSRQAFRALV TDYLVFGNAY
IERVYGRLGR LLALRPARAK YVRRGVEEGQ YWWVTSWQVA SEFERDSVIH LMEPDINQEI
YGVPDYLGAL QSILLNENAT LFRRKYYLNG SHAGFVMYVS DTAQNQEDID AMREALRNSK
GVGNFRNLFL HSPGGKKDGV QIIPISEVAA KDEFAGIKQE TRDDTLASHR VPPQLMGVMP
NNVGGFGDVE KAARVFVTNE LEPLQAVFEE VNDLVGEQVI RFRPYTLEAA SESPTG