Gene Csal_1384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1384 
Symbol 
ID4027936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1576923 
End bp1578437 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content65% 
IMG OID637966569 
ProductPhage-related tail fibre protein-like protein 
Protein accessionYP_573438 
Protein GI92113510 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.44271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAGT TCTATACCGT GCCGACCGCC GTCGGCGAAG CCAAGATCGC CAATGCCATT 
GCCCTCGGCA GGACCCTCAC CATCGCCGAG CTCGCCATCG GCGACGGCAA TGGCTCCCTT
CCCAACCCGG ATAGCGACCG CACGTCGCTG GTCAACCAGG TGCGCCGCGC GCCCATCAAC
ACCAGCGTCG TGGACGACGA CAATCCGAAT TGGATCGTAG TCGAGCAAGT GATCCCGCCG
GACGAAGGCG GATGGACGAT TCGTGAGATC GGCCTGTTCG ACTCGGACGG CGACATGATC
GCCTACGGCA ACTACCCCGA GACCTACAAG CCCGTGCTCT CGGAAGGCTC CGGCCGCACA
CAGACGATCC GGTTTGTGAT GCAGGTCTCC GACACCGCCG CCGTGACGCT GAAGGTGGAT
CCCTCCATCG TGCTGGCGAC GCGCAAGCTC GTGGACGACA AGATCGACGA GCACGCCCAA
AGCCGGGATC ACCCGGATGC CACGACGACG CAACGCGGCT TCGTCAAGAA AGCGACCTGG
GACGCGACGC GCCAGCGCTC GAGTACCGCC TCGGTGGTCA CGCCAGGCGG CCTGGATGGC
GCCATGGCCG ACCACGAGAA CGCCTCGACG GCACACCGCG CCTCGCAAAT CGCGCTCGAC
AAGGCCCTCG ACGTCTTCGG CGATGCCGAT ACCGTCCAGG CGGTGCTGGC CCTCCTGGGC
GCGGCGGCCA AGGCCGCGAA CCACAACGAC ATTGACGGCC GTGACAGTTC AGACGCGCAT
CCCATGGGTG CCATTTCCGG GCTGAATGCT GCTCTCGAGG CACTCGACCA GGCGATCGGC
ACCAAGCCCG ACCGCGACCA GGTCGTCCGG GCGGACAAGC CGTCGCAAAT CGACGTCGGC
CGTGGCGTGG TGATGATCGA AAACCACGAC CAGGACAACG CGGACGGCGC CGGTCTCACC
TTCCGCACGA CCGACAATCC CGGCGACGGA TCACCGGAAA ACGTGGGGGC GATCCTTGCC
ATCCGCTCCA GCGGTGACGC TCTGCGTCTA TGGGTCGGCC AGTCGGTGAC CTCGACCGGC
GATAACGACT TCGAGACCCG CAACCTCAAG GCATCCGGCA CCATCAGCGG CAACGGATCG
GGTCTGAACA GGGTCAACGC CGACACGGTC GATGGCTGGC ACCGGGACCG TATCCGCCAA
TGGGGCAATA TCACCGGCAA ACCGGCCACG GCAACAAGAT GGCCGCGATG GAGCGAAGTC
AGTGGAAAGC CGGACCTCAC CCAAAACACC GGTTCGGGGC AAGTCGGCAC ATACGGCATG
TTCGTGGCGC GCGGCGGCGC GACTACTCCA GGTCATACAA CATCAGGTAG TGCCCTGCGC
TGGTCGAACT GTGCGGGCAA CAGGAATAAC GGCCGCGCCC CTTCGGGTAC CTGGCGCCTG
ATGGGCTCCT TGGCTGAAGG CGACCGGGAT GCGGAATCCT CGAATTCGAC CGCGATTTAT
CTAAGGATTG CATGA
 
Protein sequence
MAKFYTVPTA VGEAKIANAI ALGRTLTIAE LAIGDGNGSL PNPDSDRTSL VNQVRRAPIN 
TSVVDDDNPN WIVVEQVIPP DEGGWTIREI GLFDSDGDMI AYGNYPETYK PVLSEGSGRT
QTIRFVMQVS DTAAVTLKVD PSIVLATRKL VDDKIDEHAQ SRDHPDATTT QRGFVKKATW
DATRQRSSTA SVVTPGGLDG AMADHENAST AHRASQIALD KALDVFGDAD TVQAVLALLG
AAAKAANHND IDGRDSSDAH PMGAISGLNA ALEALDQAIG TKPDRDQVVR ADKPSQIDVG
RGVVMIENHD QDNADGAGLT FRTTDNPGDG SPENVGAILA IRSSGDALRL WVGQSVTSTG
DNDFETRNLK ASGTISGNGS GLNRVNADTV DGWHRDRIRQ WGNITGKPAT ATRWPRWSEV
SGKPDLTQNT GSGQVGTYGM FVARGGATTP GHTTSGSALR WSNCAGNRNN GRAPSGTWRL
MGSLAEGDRD AESSNSTAIY LRIA