Gene Csal_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1920 
Symbol 
ID4028362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2180538 
End bp2181728 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content69% 
IMG OID637967114 
ProductMoeA-like protein 
Protein accessionYP_573971 
Protein GI92114043 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGC TGCAAAGCGT CGAACGTGCG CTGTCGGCAC TGCTGGACGG CGTGAGCCCG 
AGCGAGCCCG AATGGATCGA CTGCACGCAA GCCGCCGGCC GCGTCCTGGC GGAAACCGTC
ACCGCGCGTC TCGACGTGCC GCCGGCCGAT AACAGCGCCA TGGACGGCTA TGCGCTGCGC
CACGCCGATG CCGGCCAGGC CTTGCCGGTG AGCCAACGGG TCGCCGCCGG CCATGCCCCC
GAGCCGCTCG CCAGCGGTAC CTGCGCACGC ATCTTCACCG GCGGTGAAAT ACCGCCGGGC
GCCGACACGG TGGTCATGCA AGAACGCGTG ACGATGCATG AGCAGCTGGC CATCATTCCC
GACGATATCG AATGCGGTGA CAACGTGCGT CGCCGCGGGC GCGATGTACG AGCCAGCACC
CCCTTGCTGG AGGCCGGCAC ACGCCTGGAG GCCGCGGCGC TGGGGCATCT GGCCGGACAA
GGCATTACCC AGGTCGCCGT CCACCGCCGC CCTCGCATCG CCCTGCTTTC CACCGGGGAC
GAGATCGTCG AGCCCGGCCA GACGCTCGCA CCGGGACAGA TTCACAATTC GAATCGCCCC
ATGCTGTCGC GCCTGCTCGA GCGCTTCGGC GCCGACCTGG TCATGCGCGA ACACGTCGCC
GACGACTTCG CGACCACCCA GCGCCTGTTG GGCGACGCCG CCGCCTGTGC GGATATCATC
GTCACCACCG GTGGCGTCAG CGTCGGCGAG GAAGACCACG TCAAGCACGC CCTGGAATCC
CTGGGACGAC TCGACCTCTG GCAGCTGGCC ATGCGCCCTG GCAAGCCGCT GGCCTTGGGG
CGGATCGGCG AGACGCGGGT GGTGGGGCTC CCCGGCAATC CCGTGTCGTG CTTCGTGGGC
GCCTGGGTCT ATCTGCGCCC GCTGATCGGC GCCTTTCTTG CCTGTCCGCG CATGGCCACG
CTGCCGCGAC TCTGGGCGCG TGCCGACTTC GAGACGCGCA CCCAGGCGCG TCGCCACTAC
ATGCGCGTGG GGCTCGAGTT CACACCCGAG GGCGTCATCG CGCATGCCTT CGCCGATCAG
AACTCCGCCG TGCTGTCCTC GTGCCTCGAG GCGGATGCCC TGGCCGTCAT CCCCGAGCAC
ACGACCGTGC GAGCAGGCGA GCAGGTGGAA TGTCTATGGC TCACGGAGTG A
 
Protein sequence
MAELQSVERA LSALLDGVSP SEPEWIDCTQ AAGRVLAETV TARLDVPPAD NSAMDGYALR 
HADAGQALPV SQRVAAGHAP EPLASGTCAR IFTGGEIPPG ADTVVMQERV TMHEQLAIIP
DDIECGDNVR RRGRDVRAST PLLEAGTRLE AAALGHLAGQ GITQVAVHRR PRIALLSTGD
EIVEPGQTLA PGQIHNSNRP MLSRLLERFG ADLVMREHVA DDFATTQRLL GDAAACADII
VTTGGVSVGE EDHVKHALES LGRLDLWQLA MRPGKPLALG RIGETRVVGL PGNPVSCFVG
AWVYLRPLIG AFLACPRMAT LPRLWARADF ETRTQARRHY MRVGLEFTPE GVIAHAFADQ
NSAVLSSCLE ADALAVIPEH TTVRAGEQVE CLWLTE